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How will EPA regulate the power sector? 


policy. Last month, it struck down the Clean Pow- 
er Plan, the Obama administration’s never-imple- 
mented regulation of greenhouse gas emissions 
from existing power plants. The ruling [West 
Virginia v. Environmental Protection Agency (EPA)] is a 
blow to climate action and could signal the court’s hos- 
tility to a wide range of future regulations within and 
beyond the climate and environmental sphere, including 
those related to consumer protection and worker safety. 
Although the immediate effects on US climate policy 
aren’t pervasive, EPA now needs to evaluate the emis- 
sions-reduction potential and legal risks of alternative 
regulatory approaches for the power sector. 

The court’s decision didn’t question EPA’s author- 
ity to regulate greenhouse gas emissions under the 
Clean Air Act. Similarly, it didn’t 
limit authority to regulate green- 
house gas emissions from existing 
power plants under section 111(d) 
of the Clean Air Act—the provision 
used in the Clean Power Plan. The 
direct impact of the decision is rela- 
tively narrow, foreclosing only one 
approach to power sector climate 
regulation. This leaves EPA with 
other options. 

The court objected to the Clean Power Plan’s reli- 
ance on “generation shifting,” a partial shift in elec- 
tricity production from coal to natural gas, which 
generates fewer emissions, and from coal and natural 
gas to renewables, which produce no emissions. Ac- 
cording to the court, the Clean Air Act didn’t authorize 
EPA to base its standard on this technique. Climate 
regulations for other sectors, including transportation, 
oil and gas, and manufacturing, are likely to be source 
specific, and therefore unaffected by this decision. 

With generation shifting now off the table, EPA must 
choose the “best system of emission reduction” under 
the relevant provision in the Clean Air Act. This sys- 
tem must be “adequately demonstrated,’ which means 
that it cannot be too speculative. And the agency must 
“take into account the cost of achieving such reduction,” 
which means that costs cannot be excessive. EPA could 
consider basing the “system” on technologies such as 
carbon capture and storage (CCS). It could also con- 
sider requiring the combustion of coal simultaneously 
with natural gas at coal plants (cofiring), which reduces 
emissions by replacing some coal with natural gas. 


s threats from climate change become more ur- 
gent, the US Supreme Court has responded by 
erecting a new roadblock to effective climate 


“The ruling... 


is ablowto 
climate action...” 


CCS, which captures and stores carbon dioxide before 
it is released into the atmosphere, will lead to the greatest 
emission reductions among the options remaining. Thus, 
EPA is compelled to adopt it as long as other statutory re- 
quirements are met. In 2017, a document filed in a federal 
litigation on greenhouse gas standards strongly argued 
that the technology was “adequately demonstrated,” and 
there have since been further technological develop- 
ments. The process is costly but the 2021 infrastructure 
bill provides substantial subsidies for its use, and more 
federal money is possible through future congressional 
action. EPA will need to evaluate whether the funding 
sufficiently lowers the effective cost of CCS. Because some 
states have higher subsidies for this technology than 
others, EPA could divide the power sector into regions 
and require CCS in states with larger subsidies. 

Should EPA conclude that the costs of universal CCS 
are excessive, it could require power plants with mul- 
tiple contiguous generating units to 
partially adopt CCS. For example, 
EPA standards could require plants 
with three or more contiguous 
units to use CCS at one-third of the 
units. This technique, called “bub- 
bling,’ is a common regulatory tool 
upheld by the Supreme Court. EPA 
could also evaluate whether the 
statute permits this approach for 
noncontiguous sources. 

As an alternative, the agency could require that 
coal plants cofire with natural gas. Because many coal 
plants have already adopted cofiring, it is clearly “ad- 
equately demonstrated” and doesn’t require that “coal 
plants...become natural gas plants’—which the Su- 
preme Court indicated would be problematic. If EPA 
were concerned about the legality of an across-the- 
board cofiring standard, the agency could restrict this 
requirement to a subcategory of the industry that al- 
ready uses cofiring and set standards for other sources 
using a different approach. 

Generation shifting would have reduced emissions 
at the lowest cost through a cap-and-trade program or 
related regime, but that’s no longer possible. Unfortu- 
nately, other regulatory approaches for the power sector 
will be more costly or less effective. Moreover, these 
regulations, as well as other regulations in the climate 
change and environmental sectors, may be slowed or 
stalled on other grounds. The fallout from the Supreme 
Court’s decision across the economy is likely to have 
only just begun. 

-Richard L. Revesz 
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The court is lost 


he United States has an insatiable desire for 
technological advancement but is governed 
by founding documents that are completely 
unsuited for science and technology. This in- 
congruity has manifested in recent disastrous 
actions by the US Supreme Court on guns, 
abortion, and climate. The decisions suggest 
that the battle is being won by the portion of America 
who—while lionizing the past and clinging to the in- 
fallibility of words written in the late 18th century— 
can’t put down their cell phones. Reactionary posts on 
social media wouldn’t get very far without a hundred 
years of technical advances—and massive amounts of 
power to recharge mobile device batteries and run the 
server farms that support the digital world. Because 
the disconnect between aspects of modern life and the 
framing of the country’s governance appears inconse- 
quential to the conservative majority of justices in the 
US Supreme Court, it is vital that 
the scientific community advocate 
a political and societal landscape in 
which compassion and adaptability 
attend technological progress. 

Some of the most compelling 
writing on this topic has come 
from Harvard historian Jill Lepore. 
Lepore wrote a prescient Science 
editorial before the 2020 US presi- 
dential election in which she made 
the case that accurately counting 
the votes in the election was more 
important than a quick tally. In her 
sweeping and compelling history of 
the United States, These Truths, Lepore chronicles the 
political impacts of America’s obsession with techno- 
logical progress that began with the industrial revolu- 
tion and continued through space exploration, nuclear 
weapons, and the internet. Science and technology 
marched ahead against a backdrop of devotion to a set 
of founding documents that didn’t contemplate any of 
these advances. 

“The Constitution establishes fundamental law, the 
law that governs the government,” Lepore told me. “At 
the time it was written, in 1787, women couldn’t run 
for office, or vote; they were shunned even for express- 
ing political views. At the first federal census in 1790, 
nearly one in six people were held as property. Bearing 
‘arms’ at the time the Bill of Rights was ratified, in 
1791, meant, mostly, hauling around a lousy old mus- 
ket that took ten minutes to load in hopes of shooting 
a rabbit for supper.” 


“The court’s 
recent spate of 


bad decisions 
mocks scientific 
facts.” 


The court’s recent spate of bad decisions mocks scien- 
tific facts. It obviated a modest and reasonable gun con- 
trol law in New York despite research showing that gun 
control saves lives. It removed a woman’s right to abor- 
tion even though the Turnaway Study and others like it 
show that access to abortion improves health. They at- 
tributed their rulings to fealty to a document written by 
slaveowners who had no idea about automatic weapons 
and who didn’t even think women should participate in 
the government. 

The recent decision on the Environmental Protection 
Agency (EPA) is a body blow for environmental science. 
It struck down the implementation of a completely 
reasonable plan that allowed the agency to regulate 
greenhouse gas emissions from existing power plants. 
The court objected to the fact that the plan would have 
shifted production from coal to natural gas. Appar- 
ently, they couldn’t find any support in the Constitu- 
tion for environmental regulation. 
The notion that the Constitution 
would contemplate climate change 
is ridiculous. As Lepore pointed out, 
“The average annual temperature in 
Philadelphia was fifty-two degrees 
Fahrenheit. By 1962, the year Rachel 
Carson published Silent Spring, it 
was fifty-four; by 2017, the year the 
Trump Administration withdrew the 
United States from the Paris Climate 
Accords, it was fifty-nine.” 

The scientific enterprise has con- 
tributed to the tension between the 
conservative court and a developed 
society by enjoying the psychic and monetary rewards 
for its technological achievements without demanding 
social responsibility in return. Science must realize that 
progress is more than just filling journals with knowl- 
edge and hoping for the best. The scientific community 
must value and partner with communicators and policy- 
makers who can help show that scientific advancement 
demands that the nation operate as a work in progress. 
Otherwise, America will be stuck with a government 
that worships a set of documents created by men who 
had no idea about evolution, dinosaurs, hydrocarbons, 
women’s health, or digital communication. “There’s a 
great deal to be said for constitutional stability, for re- 
lying on the constitution as the expression of a set of 
foundational principles,” said Lepore. “But reading it as 
a Bible for solving the problems of the 21st century? The 
shortcomings are overwhelming.” 

-H. Holden Thorp 


H. Holden Thorp 
Editor-in-Chief, 
Science journals. 
hthorp@aaas.org: 
@hholdenthorp 
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ASTRONOMY 


New telescopes will rapidly 
scan the sky to facilitate 
timely observations of 
areas where gravitational 
waves originated. 


Speedy scopes to spy gravitational wave sources 


esearchers last week reached the midpoint in 
building a pair of observatories designed to pin- 
point the location of cataclysmic events sensed by 
gravitational wave detectors so that other astron- 
omers can quickly zoom in on the aftermath. The 
Gravitational-wave Optical Transient Observer 
(GOTO) uses two sets of 16 small telescopes, one in the 
Canary Islands—now operational—and one in Australia, 
whose construction has just started. They will swing into 
action automatically when gravitational wave detectors, 
such as the Laser Interferometer Gravitational-Wave 
Observatory (LIGO) in the United States and Virgo in 
Italy, register a space-time ripple caused by events such 


as a merger of two black holes. Such detectors don’t give 
precise locations, so GOTO’s scopes will sweep the region 
of space for rapidly brightening objects that could be the 
source of the wave; operators will then send alerts, giv- 
ing more sensitive optical telescopes a location to aim at. 
A neutron star merger, spotted gravitationally in 2017, 
produced visible signals, but telescopes took 11 hours to 
find them after the event became known, and missed 
the explosion’s early phases. The £4.4 million GOTO 
telescopes hope to do better and flag candidates in half 
an hour. Project leaders aim to be ready by March 2023, 
when upgrades to LIGO and Virgo are finished and they 
begin their next observing runs. 


Monkeypox emergency declared 


INFECTIOUS DISEASES | The World Health 
Organization (WHO) last week declared 
the global monkeypox outbreak, which 
has sickened more than 15,000 in at least 
70 countries, a Public Health Emergency 
of International Concern (PHEIC), even 
though a sharply divided advisory com- 
mittee had not recommended doing so. In 
June, WHO’s Emergency Committee for 
monkeypox first advised against giving 
the epidemic PHEIC status—which grants 
WHO extra powers and helps focus politi- 
cal attention on an outbreak—a decision 
that was widely criticized. At its second 
meeting on 21 July, the panel could not 
reach a consensus, with nine members 
opposing a PHEIC declaration and six 
supporting it. (The meeting was followed 
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by tense exchanges between participants 
via email and text messages, Science has 
learned.) Opponents noted that monkeypox 
is not yet circulating in the general popula- 
tion; it has been observed mostly in men 
who have sex with men. But others argued 
for action now because there is a danger 
the virus could become established long- 
term in the wider population. On 23 July, 
WHO Director-General Tedros Adhanom 
Ghebreyesus said the outbreak met criteria 
in the International Health Regulations 
and, in an unprecedented move, declared a 
PHEIC anyway. 


NASA delays Moon rover launch 


LUNAR SCIENCE | The first NASA robotic 
rover to visit the Moon will be delayed by 
a year, launching in November 2024, the 


agency announced last week. The Volatiles 
Investigating Polar Exploration Rover, 
known as VIPER, will land at the Moon’s 
south pole, hunting for water ice. The 
delay, which will nudge the $754 million 
mission closer to the planned launch by 
China of a similar device, will allow further 
testing of the lunar lander, developed by 
Astrobotic Technology, that will carry the 
rover in the lander’s first lunar touchdown. 


‘Plan B’ for U.K. science funding 


BREXIT | The U.K. government last week 
outlined a domestic program to replace 
funding that U.K.-based researchers would 
have had access to from the European 
Union’s flagship research scheme. The EU 
has avoided finalizing a deal to allow U.K. 
participation in the 7-year, €95 billion 
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Horizon Europe program because of 
ongoing disputes over trade in Northern 
Ireland. The United Kingdom’s “Plan B” 
would kick in if the Horizon deal com- 
pletely falls through. U.K. researchers could 
continue to participate in Horizon-funded 
international collaborations that include at 
least three partners in EU member states 
or other Horizon-associated countries. EU 
rules would require these U.K. researchers 
to bring their own funding to the table, 
and Plan B would put no immediate limits 
on funding such consortium participation. 


Trump policy on species reversed 


CONSERVATION | Completing a review 
President Joe Biden promised on his first 
day in office, the U.S. Fish and Wildlife 
Service (FWS) last week finished rolling 
back changes made to the Endangered 
Species Act by former President Donald 
Trump’s administration. One of Trump’s 
changes weakened a provision that governs 
when property owners may damage criti- 
cal habitat needed by endangered species 
to survive. The Trump rule would have 
required FWS to allow landowners to do 
such damage—by filling wetlands, for 
example—if they showed credible evidence 
that continuing to protect the critical habi- 
tat would cause economic harm. On 

21 July, the Biden administration re- 
instated a previous rule that allowed FWS 
to use its discretion on whether to grant 
such requests. In June, the agency undid a 
separate Trump-era decision that raised the 
bar for designating land as critical habitat. 


Surgeon to head cancer institute 


LEADERSHIP | President Joe Biden is 
expected to pick cancer surgeon Monica 
Bertagnolli as the next director of the 
National Cancer Institute (NCI), the 
world’s largest funder of cancer research. 
Bertagnolli, a physician-scientist at 
Brigham and Women’s Hospital, the 
Dana-Farber Brigham Cancer Center, and 
Harvard Medical School, specializes in 
gastrointestinal cancers and is well known 
for her expertise in clinical trials. Her lab 
research on tumor immunology and the 
role of a gene called APC in colorectal 
cancer led to a landmark trial she headed 
showing an anti-inflammatory drug can 
help prevent this cancer. Bertagnolli, 63, 
will be the first woman to lead NCI. 


Ahead for U.K. ‘high-risk’ funder 


LEADERSHIP | Ilan Gur, an entrepre- 
neur and materials scientist, will lead 
the United Kingdom’s new agency to 
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fund outside-the-box research leading 

to commercial spinoffs. The government 
this year created the Advanced Research 
and Invention Agency (ARIA), modeled 

on the U.S. Defense Advanced Research 
Projects Agency (DARPA) and its sponsor- 
ship of “high-risk, high-reward” science. In 
February, ARIA selected DARPA’s deputy 
director, Peter Highnam, as its head, but 
he withdrew before starting the job. Gur 
worked as a program officer at ARPA-E, 
the U.S. Department of Energy’s version of 
DARPA. He also founded the U.S. nonprofit 
Activate, which has supported the creation 
of more than 100 science-based startups, 
according to a government statement. 

To encourage innovative grantmaking, 
ARIA’s £800 million, 4-year budget will 

be separate from that of the country’s 


PALEONTOLOGY 


Oldest known 
relative of living 
animals revealed 


The National Museums of Kenya, in The 
Star, on the death of Kamoya Kimeu, 81, 
a Kenyan who found many important hominin 
fossils including “Turkana Boy,” an almost 
complete Homo erectus skeleton, in 1984. 


main research funder, UK Research and 
Innovation; the sum amounts to only 1% 
of total R&D spending. Unlike DARPA 

and ARPA-E, ARIA so far lacks a specific 
research focus, which some observers have 
predicted will undermine its effectiveness. 


fossil of an organism between 556 million and 562 million years old bears features 

resembling those of modern jellyfish and coral, and appears to be the oldest 

example of an evolutionary group still living today. Paleobiologist Philip Wilby 

and colleagues at the British Geological Survey discovered the new fossil in the 

Charnwood Forest, a hotbed of pre-Cambrian paleontology in central England. The 
organism predates the Cambrian explosion of about 539 million years ago, which scien- 
tists historically accepted as the origin of modern animals. The specimen, Auroralumina 
attenboroughii (artist’s reconstruction, above), belongs to a group called cnidarians, 
which includes today’s jellyfish and corals, the research team concludes this week in 
Nature Ecology & Evolution. The symmetry of its body is like that of modern jellyfish. The 
team selected its name in part to honor naturalist and broadcaster David Attenborough, 
who spent his childhood near Charnwood and has brought attention to its fossils. 
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IN DEPTH 


HEALTH 


Surprise virus tied to pediatric 


hepatitis cases 


Two viruses plus a child’s genetic background may explain 
a recent surge in the United Kingdom 


By Jocelyn Kaiser 


tarting in April, young children 

with mysterious cases of acute 

liver inflammation began to attract 
scientific—and media—attention in 

the United Kingdom and elsewhere. 

Most would recover, some after liver 
transplants, but a few died. Physicians 
could find no evidence in the children of 
viruses that typically cause hepatitis, but 
researchers homed in on an unexpected 
suspect: adenovirus, a family of cold- 
causing viruses common in kids. Now, a 
sweeping genetic search of these patients 
instead implicates another virus, one pre- 
viously thought to be completely harmless. 
In two independent, preliminary stud- 
ies, U.K. researchers found high adeno- 
associated virus 2 (AAV2) levels in the 
blood or liver cells in 24 of 25 children 
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with unexplained hepatitis. In a group of 
kids without this condition, almost none 
had AAV2, even those with adenovirus. 
The young hepatitis patients were also 
much likelier to have a genetic variant that 
may make their immune systems overreact 
to viruses. 

The preprints, one posted on medRxiv 
on 19 July and the other just submitted 
there, offer new clues to the mystery pe- 
diatric hepatitis, which now numbers 
more than 1000 probable cases in 35 coun- 


“It’s intriguing for sure, 
but it’s avery small number 
of cases and controls.” 


Saul Karpen, 
Emory University School of Medicine 


In March, 3-year-old Lola-Rose Raine of Gravesend, 
England, received a liver transplant from her father 
after she developed severe, unexplained hepatitis. 


tries and has caused at least 22 deaths. 

But outside researchers are cautious 
about the studies, which haven’t been 
peer reviewed. “It’s intriguing for sure, 
but it’s a very small number of cases and 
controls. It could be associations and not 
causation,” says Saul Karpen, a pediatric 
gastroenterologist at Emory University 
School of Medicine. 

Severe hepatitis in children is rare and 
often goes unexplained. But alarm bells 
went off in the spring after hospitals in the 
United Kingdom reported a rise in cases— 
now up to 272—that are not linked to the 
usual causes, such as a hepatitis A, B, or C 
virus. Another 334 such unexplained cases 
have been reported in the United States, 
but the Centers for Disease Control and 
Prevention said in June that these are not 
above usual levels. 

Still, a spring spike in adenovirus infec- 
tions in the United Kingdom lent credibil- 
ity to a hepatitis surge there. And last week 
in two papers in The New England Journal 
of Medicine, teams in Birmingham, Eng- 
land, and Birmingham, Alabama, reported 
adenovirus 41, a specific type that causes 
gastrointestinal illness, in the majority of 
nine U.S. and 30 U.K. cases tested. 

But the London and Glasgow, Scotland, 
teams behind the two preprints wanted 
to make sure some new virus or viral 
variant wasn’t emerging. They used DNA 
and RNA sequencing of tissue samples 
from U.K. cases to look for about 200 dif- 
ferent virus families. AAV2 jumped out. 
It was in liver or blood samples in all 
nine Scottish cases examined and 15 of 
16 separate cases from all over the United 
Kingdom, according to the studies. In 
contrast, almost none of 158 controls, 
which included healthy children and chil- 
dren with hepatitis for known reasons, 
had AAV2. 

The viral family to which AAV2 belongs 
was initially discovered in samples that 
contained adenoviruses—AAVs need a pro- 
tein from a distant viral relative, usually 
adenovirus or herpesvirus, to replicate. 
Many people are infected by AAV2 by age 
10, but the virus can lie dormant in cells 
until that helper virus comes along and 
activates it. 

Most of the patients also had adeno- 
virus, usually the 41 type. But AAV2 never 
turned up in a control group of 12 Scottish 
children who had an adenovirus infec- 
tion but no hepatitis. The authors of the 
preprints also ruled out SARS-CoV-2, an 
early suspect, partly because its infection 
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rates weren’t higher than average in the 
Scottish hepatitis patients. 

The pandemic could, however, indirectly 
explain the rise of pediatric hepatitis in the 
United Kingdom. Relaxed COVID-19 restric- 
tions meant children may have been exposed 
to a “cocktail of viruses” all at once versus 
gradually, according to Emma Thomson, an 
infectious diseases specialist at the MRC- 
University of Glasgow Centre for Virus Re- 
search and a leader of the Scottish study. 

Both studies also found that all but two of 
14 sick children tested carried a specific ver- 
sion of a type of gene called HLA that helps 
shape the body’s response to pathogens. The 
variant is particularly common in northern 
Europeans—16% of Scottish people carry it, 
and it is known to be linked to some auto- 
immune disorders, Thomson notes. 

Although the researchers found genetic 
material from AAV2 in patient’s liver cells, 
they did not detect viral proteins or actual 
copies of the virus. That suggests instead 
of directly damaging liver cells, AAV2 may 
provoke an immune response that harms 
the organ. That theory is supported by the 
link with HLA type and that some kids had 
gastrointestinal illnesses many weeks be- 
fore they developed hepatitis, said a leader 
of the London team, virologist Judy Breuer 
of Great Ormond Street Hospital and the 
University College London Great Ormond 
Street Institute of Child Health, at a press 
briefing Monday. 

A pathogenic role for AAV2 “goes against 
everything we know” about the virus, says 
virologist Eric Kremer of the Institute of 
Molecular Genetics of Montpellier. Look- 
ing at the type of antibodies to AAV2 in the 
children, he notes, could support its guilt 
by distinguishing between a long-ago in- 
fection, or a more recent one timed with 
the onset of hepatitis. Those studies are 
planned, Thomson says. 

The good news for U.K. parents in all 
this: Pediatric hepatitis cases have dropped 
in recent weeks to “almost ... background,” 
said Meera Chand of the UK Health Secu- 
rity Agency, which expects to release its 
own study of adenoviruses in these cases 
this week. 

At the same time, the studies raise ques- 
tions about delivering therapeutic genes 
with AAVs, as many groups are trying to, 
Kremer notes. Although the viruses are 
further modified so they can’t replicate 
even if a helper virus is around, they have 
sometimes caused liver inflammation— 
which in rare cases may have contributed 
to deaths. Stanford University gene ther- 
apy researcher Mark Kay says the field may 
want to explore whether HLA type can pre- 
dict whether an AAV gene therapy patient 
will experience liver toxicity. 
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SCIENTIFIC COMMUNITY 


NSF grant decisions reflect 
systemic racism, study argues 


Success rates for white scientists far exceed the NSF 
average, whereas Black and Asian researchers do worse 


By Jeffrey Mervis 


hite scientists are more likely to win 

a grant from the National Science 

Foundation than researchers from 

other racial and ethnic groups, ac- 

cording to an independent analysis 

of more than 2 decades of NSF data 
on its merit review process. 

The analysis supports earlier studies find- 
ing similar racial disparities in the funding 
of scientists by other federal agencies, nota- 
bly the National Institutes of Health (NIH). 
And its authors—a team led by geochemist 
Christine Yifeng Chen, a postdoc at Lawrence 
Livermore National Laboratory—attribute 
the gap in NSF funding 
rates, with white scientists 
at the top and Asian re- 
searchers at the bottom, to 
“systemic racism.” 

The NSF funding dis- 
parities “have cascading 
impacts that perpetuate a 
cumulative advantage to 
White [principal investiga- 
tors] across all of science, 
technology, engineering, 
and mathematics, they 
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Funding issue 

A study of racial disparities in 
National Science Foundation 
funding concludes that white 
researchers reaped a large 
cumulative “surplus” of awards 
between 1999 and 2019, whereas 
Asian applicants experienced 
significant “underfunding,” with 
other groups falling in between. 


“T think it’s significant that this project 
was initiated by early-career scientists,” 
Chen says. “It speaks to the prevailing cul- 
ture in academia that allows the status quo 
to be perpetuated. We felt that if we didn’t 
do the analysis, nobody else would.” 

Every scientist submitting a proposal to 
NSF faces stiff competition; overall success 
rates fluctuated between 22% and 34% over 
the study period. But white scientists con- 
sistently did better, Chen and colleagues 
found. (Scientists are asked to voluntarily 
provide their race and ethnicity when ap- 
plying for a grant.) 

In 2019, for example, NSF funded 31.3% 
of proposals from white scientists, versus an 
overall rate of 27.4%. In con- 
trast, the success rate was 
22.4% for Asian scientists 
and 26.5% for Black scien- 
tists. Proposals from Latino 
scientists were funded 29% 
of the time, a rate slightly 
above average but below the 
rate for white scientists. 

Chen and_ colleagues 
translate the higher success 
rates into what they call “sur- 
plus awards.” In 2019, when 


write in their study, posted 


earlier this month on the White 


American Indian/ 
Native Alaskan 


Center for Open Science 
preprint site. 


— NSF received about 42,000 
+12,820 proposals, the team calcu- 
+80 lated that white scientists 


received 798 surplus grants. 


The team gave a copy of 
its analysis to NSF leader- 


Native Hawaiian/ 
Pacific Islander 


-17 The cumulative surplus over 
20 years was 12,820 awards 


ship, which is not chal- Hispanic/Latino 


(see table, left). 


lenging its conclusions. 
NSF Director Sethuraman 


Black/African American 


Panchanathan “shares Asian 

these concerns [about] 

systemic racial disparities in funding at 
NSF and other federal agencies,” an agency 
spokesperson says. 

The researchers delved into detailed 
annual NSF reports that contain data on 
1 million proposals submitted to the agency 
between 1996 and 2019. The reports men- 
tion racial disparities only in passing, but 
Chen and her team focused on them after 
hearing complaints from senior nonwhite 
colleagues about what they felt was an un- 
even playing field in winning NSF grants. 


-175 : ; 

In contrast, Asian sci- 
747 entists received 460 fewer 
-9701 awards in 2019 than they 


would have had their suc- 
cess rate been comparable, with a cumu- 
lative “unfunded” total of 9701 awards. 
(Asian scientists submit the second most 
proposals of the various groups every year, 
roughly half the number sent in by white 
scientists.) The relative advantage for white 
scientists has steadily grown, the analy- 
sis shows, from 3 percentage points above 
the agency’s average success rate in 1999 to 
14 percentage points above in 2019. 
For Black scientists, the funding gap over 
that period was smaller, but still significant. 
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(Black researchers submit one proposal for 
every 20 proposals NSF receives from white 
scientists.) The average funding rate for Black 
scientists was 8 percentage points below that 
of white scientists, according to the pre- 
print’s authors. Pacific Islander and Native 
Hawaiian scientists, an even smaller cohort 
of applicants, had success rates 11 percentage 
points below that of white scientists. In con- 
trast, Latino scientists—who submit about 
50% more proposals to NSF than do Black 
scientists—have done slightly better than the 
norm but were 2 percentage points below the 
success rate for white scientists in 2019. 

About 75% of the proposals NSF receives 
each year are classified as research propos- 
als. The remaining requests are to support 
education and training, equipment and 
facilities, conferences, and other activities. 
White scientists enjoy an even larger ad- 
vantage over most other groups in winning 
research awards, the study found. For non- 
research awards, most nonwhite groups did 
better than the NSF-wide average. 

In 2011, a team led by University of Kan- 
sas, Lawrence, economist Donna Ginther 
found similar racial disparities among NIH 
grant recipients, including a gap of up to 
13 percentage points in success rates be- 
tween white and Black scientists. That gap 
shrunk—but did not disappear—in follow- 
up studies that accounted for such factors 
as an applicant’s publication history, prior 
funding, age, academic rank, and how much 
research takes place at their institution. 

Even so, former NIH Director Francis 
Collins last year apologized to “individuals 
in the biomedical research enterprise who 
have endured disadvantages due to struc- 
tural racism.” NIH has tried to reduce dis- 
parities with programs designed to increase 
the success of Black applicants. 

Unlike NIH with Ginther, NSF did not 
give Chen’s team access to applicant data 
that would have allowed it to do such a 
multivariate analysis. “That information 
would have been extremely valuable for 
looking at issues of intersectionality,’ says 
co-author Aradhna Tripati, a geoscientist 
at the University of California, Los Angeles. 
“We also could have looked at the impact 
of NSF’s existing programs to foster equity 
and broader participation in science.” 

Even without that extra level of analy- 
sis, the study’s findings appear sound, says 
Susan White, director of the statistical re- 
search center at the American Institute of 
Physics. “It may not be exactly the number 
[the authors] present, but I don’t doubt that 
the disparity is real, and serious,” she says. 

An NSF spokesperson says although the 
agency is proud of its array of programs 
designed to address equity and inclusion, 
“there is still much [work] to do.” 
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HUMAN EVOLUTION 


Ancient Europeans farmed 
dairy—but couldn't digest milk 


Giant study of ancient pottery and DNA challenges 
common evolutionary explanation for lactase persistence 


By Cathleen O’Grady 


ver the past 10,000 years, populations 

living far apart in Europe, Africa, 

South Asia, and the Middle East sepa- 

rately acquired a key genetic change: 

the ability to digest the milk sugar 

lactose as adults. Researchers thought 
people who had that ability and lived in dairy 
farming cultures got a nutritional boost and 
had more children, thus spreading the ge- 
netic changes. 

But in recent years, unexpected findings— 
such as data from Mongolia, where people 
devour milk products 
but 95% of adults are 
genetically lactose 
intolerant—have  chal- 
lenged that story. Now, 
a study combines large 
archaeological data sets 
on dairy farming with 
ancient DNA and finds 
that across Europe, peo- 
ple consumed dairy for 
millennia before lactase 
persistence into adult- 
hood was widespread. 
The researchers  sug- 
gest illness and famine 
may have turned lactose 
intolerance from un- 
comfortable to deadly, 
driving periods of intense selection for the 
digestive trait. 

The study “changes our long-term under- 
standing of the relationship between milk 
use and lactase persistence,” says Jessica 
Hendy, an archaeologist at the University of 
York who was not involved in the work. 

In the new study, archaeologists compiled 
evidence of milk from nearly 7000 pieces of 
ancient pottery, taken from 554 European 
sites representing the past 9000 years. They 
tracked the rise and fall of dairy farming 
across Europe by analyzing the fats preserved 
in the pottery. With ancient DNA specialists, 
they then compared this with signs of lac- 
tase persistence in 1293 published human 
genomes from the same regions and period. 

Fluctuating dairy use over time didn’t 
match up with changes in lactase persis- 


A farmer in Mongolia, where most people 
are lactose intolerant, milks a cow. 


tence. Instead, the researchers found that 
what they considered signals of famine and 
sickness best matched the jumps in lactase 
persistence in ancient DNA, they report in 
Nature this week. (They used archaeological 
records to identify periods of shrinking 
populations—perhaps famines—as_ well 
as times of greater population density— 
possibly times of faster disease spread.) 

Lactose intolerance in dairying cultures 
might be dangerous for people who were 
sick or starving, suggests co-author Mark 
Thomas, a human evolutionary geneticist at 
University College London. A lactose intol- 
erant person consuming 
milk normally suffers 
flatulence and diarrhea, 
with no more severe ef- 
fects than embarrass- 
ment and discomfort, 
Thomas says: “But if 
you get diarrhea when 
you're severely malnour- 
ished, then you have 
serious problems. One 
of the biggest causes 
of death in the world 
is fluid loss in severely 
malnourished people.” 

The findings support 
the idea that dairy farm- 
ing alone wasn’t the key 
force behind the spread 
of lactase persistence, the researchers say: 
The selection pressure likely only grew strong 
when combined with sickness and starvation. 

It’s an “exciting avenue” for ongoing 
research, Hendy says. But she cautions 
it’s difficult to estimate ancient popula- 
tion fluctuations and understand what led 
to them. 

The research complements previous re- 
sults, such as the puzzlingly late arrival of 
widespread lactase persistence in Central 
Europe, says Christina Warinner, a molec- 
ular archaeologist at Harvard University. 
But she says the new study brings the heft 
of several large data sets to the question. 
The story of dairy farming has been “full of 
surprises,” Warinner says. “It’s helping us 
to really appreciate better the complexities 
of the past.” 
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Radio bursts from ‘zombie’ 
black holes excite astronomers 


Delayed emissions from black holes that fed on stars earlier 
could help explain the formation of powerful jets 


By Zack Savitsky 


hen a hapless star ventures too 

close to one of the supermassive 

black holes that lurk at the center 

of galaxies, it’s torn to shreds and 

stretched like spaghetti. In this 

so-called tidal disruption event 
(TDE), the black hole dines on the stellar 
remnants, which wrap around the black 
hole’s belly in an accretion disk. During the 
feast, the black hole can glow brighter than 
a supernova for months, before returning to 
a quiet state of hibernation. 

Or so the story usually goes. 

Continued monitoring by patient astron- 
omers has now revealed a few cases in which 
black holes wake up and belch matter and 
energy, sending bursts of radio waves toward 
Earth months or even years after the initial 
TDE. “What’s incredibly unusual about [these 
events] is that the objects came back to life, 
like a zombie,” says Enrico Ramirez-Ruiz, a 
theoretical astrophysicist at the University 
of California, Santa Cruz. “This is really chal- 
lenging the paradigm.” 

Astronomers aren’t sure what’s triggering 
the delayed outbursts, but they think the 
emissions could help explain the mysteri- 
ous mechanisms by which black holes con- 
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vert infalling stellar material into powerful 
jets that rocket out from their poles. “It’s 
telling us something about the physics of 
the central engine that’s otherwise hidden 
from us,” says Sasha Tchekhovskoy, a com- 
putational astrophysicist at Northwestern 
University. “These jets can explode entire 
galaxies, so it’s a really important process in 
the evolution of galaxies.” 

Most of the few dozen known TDEs have 
been detected from the optical light or x-rays 
emitted in the initial feast. But, “Radio is now 
playing a very important role” in understand- 
ing TDEs, says astronomer Igor Andreoni of 
the Joint Space-Science Institute. Black holes 
generate radio waves by expelling plasma— 
pumping it out in polar jets or belching out 
material that crashes into surrounding gas. 
But these outflows normally take place dur- 
ing a TDE, shortly after the black hole rips 
apart its meal. 

In February 2021, however, Assaf Horesh, 
an astrophysicist at the Hebrew University 
of Jerusalem, discovered a radio burst that 
came 6 months after the initial TDE. Then, 
on 30 June, Yvette Cendes, an astronomer at 
the Harvard & Smithsonian Center for Astro- 
physics, reported finding another delayed 
flare in a preprint posted to arXiv. Using mul- 
tiple observatories, she and her colleagues 


Supermassive black holes can produce powerful jets 
when shredding nearby stars, as seen in this artist’s 
conception. Now, astronomers are seeing secondary 
bursts months or years after the fact. 


documented a rapid spike in radio activity 
that launched more than 2 years after the 
black hole’s initial snack. “It’s a pretty excep- 
tional case,’ Cendes says. 

Horesh’s graduate student Itai Sfaradi may 
have caught a third example. Reanalyzing 
a previously spotted TDE, Sfaradi claims in 
the 10 July issue of The Astrophysical Jour- 
nal that he found delayed radio emissions 
in combination with an x-ray flare. These 
tandem emissions are sometimes seen in so- 
called x-ray binaries—in which star-size black 
holes suck gas away from a paired star— 
hinting that the mechanisms may be related, 
Horesh says. 

Shifts in the black hole’s accretion disk 
power the flare-ups from x-ray binaries, and 
Ramirez-Ruiz thinks the same thing may 
be happening with the supermassive black 
holes, months after a TDE. In this scenario, 
a star’s spaghettified gas piles up more slowly 
over time, causing the accretion disk to grow 
colder and thinner. Eventually, the disk weak- 
ens enough to open an escape path that al- 
lows the black hole’s magnetic field lines to 
launch material from the disk into space, 
where it crashes into surrounding gas and 
produces radio bursts. 

Tchekhovskoy agrees—and he’s got models 
that demonstrate the behavior. He and col- 
leagues ran computer simulations of accre- 
tion disk evolution and found they can reach 
a Goldilocks state in which jets can form effi- 
ciently. The key moment comes when the ac- 
cretion disk is still dense enough to fuel jets, 
but not so dense as to reabsorb the generated 
radio waves. 

Perhaps that’s why we're seeing these de- 
layed bursts, he says—“We're just waiting for 
the gas to have the right density.” 

More clues could come if wide-field radio 
surveys can capture other zombie awaken- 
ings. The Very Large Array, a complex of tele- 
scopes in New Mexico, is set to scan the skies 
for the third time next year, and the Austra- 
lian Square Kilometre Array Pathfinder will 
launch a full-sky survey later this year. Both 
Cendes and Horesh plan to conduct follow- 
up radio surveys of TDEs using these obser- 
vatories, among others. In unpublished work, 
Cendes has already found what she thinks 
are several more candidates. 

Discovering a larger population of these 
TDEs with delayed outbursts would unlock 
a natural laboratory, enabling theorists to 
investigate black hole behavior under a wide 
range of conditions, Ramirez-Ruiz says. For 
physicists, he says, “Black hole gastronomy 
really provides a new playground.” 
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BIODIVERSITY 


The most unusual birds are 
also the most at risk 


Two studies predict homogenization of the avian 
world as climate change and other human impacts continue 


By Elizabeth Pennisi 


t’s bad enough that Earth could be losing 
thousands of species each year. Now, two 
independent studies of birds have con- 
cluded the ones most likely to disappear 
are those that serve unique—and possibly 
irreplaceable—functions in their eco- 
systems. Consider the toucan: Its iconic beak 
lets it eat and disperse seeds and fruit too 
large for other birds in South American rain- 
forests. Yet these striking creatures, as well as 
vultures, ibises, and others with distinctive 
physical traits, are likely to be the first to go 
extinct, homogenizing the avian world, ac- 
cording to one study. A second predicts com- 
munities will grow more alike as the climate 
changes and species flock to cooler regions. 

“That’s alarming because we know that 
diversity of sizes and shapes and behaviors 
is a signature of a healthy community,” says 
Scott Edwards, an evolutionary biologist at 
Harvard University. “This is laying out the 
grim world we are going to be facing.” 

He and others hope the studies will spur 
conservationists to think more _ broadly 
about what biodiversity means. “The im- 
pacts of human actions can actually be 
worse than what we might think just based 
on species tallies alone,” says Jedediah 
Brodie, an ecologist at the University of 
Montana, Missoula. “We haven’t focused on 
what kinds of functions we might be los- 
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ing,” adds macroecologist Marta Jarzyna of 
Ohio State University. 

Every ecosystem depends on diverse or- 
ganisms to fill a variety of roles. Among birds, 
some eat and disperse seeds whereas others 
eat carrion, helping recycle remains. Special 
traits aid these tasks: Pointed beaks help vul- 
tures tear into flesh whereas long legs keep 
wading birds’ bodies dry. “When communi- 
ties are homogenized, they lose a lot of those 
ecological functions,” Brodie says. 

To assess whether birds with certain sets 
of traits are disappearing, Emma Hughes, a 
newly minted Ph.D. at the University of Shef- 
field, spent several years measuring the beak 
size and shape, lower limb and wing length, 
and body size of museum specimens of almost 
8500 birds from around the world. Working 
with conservation biologist David Edwards 
and macroecologist Gavin Thomas, both also 
at Sheffield, she used statistical techniques 
to chart similarities and differences among 
the species based on these traits. Many birds, 
such as songbirds, clustered together in 
shape. Outliers included big albatrosses and 
tiny hummingbirds. 

Next, the researchers began to pare down 
this chart by removing the birds most likely to 
go extinct, using the International Union for 
Conservation of Nature’s Red List of Threat- 
ened Species, which ranks species accord- 
ing to their probability of disappearing. The 
most threatened species also turned out to be 


Birds of distinctive form and specialized ecological 
niches, such as these Himalayan griffon vultures in 
China, may be more likely to go extinct. 


the most distinctive in body shape and eco- 
system function, the group reported last week 
in Current Biology. As the researchers se- 
quentially removed species from most to least 
threatened, toucans, ibises, and other distinc- 
tive species dropped out; similar birds—think 
finches and starlings—remained. 

To identify regions where the trend might 
hit hardest, the Sheffield team analyzed birds 
in 14 major habitats, such as tropical grass- 
lands. They found that species homogeniza- 
tion will affect 12 of the 14 and will be most 
extreme in flooded grasslands and tropical 
forests. The “most imperiled” regions in- 
cluded southern Vietnam, Cambodia, and 
the Himalayan foothills, as well as islands 
such as Hawaii, which has already lost all its 
distinctive honeycreepers, Hughes notes. “In 
some cases, there are no other organisms that 
can replace the unique ecological roles that 
these species play,’ Brodie says. 

A separate study led by Alke Voskamp, 
a macroecologist at the Senckenberg Bio- 
diversity and Climate Research Centre, 
identified another homogenizing force: 
climate-driven shifts in birds’ ranges. Her 
European team mapped the current ranges 
of 9882 bird species compiled by BirdLife 
International. Then they applied projec- 
tions from global climate models to predict 
where those species could find hospitable 
habitat through 2080, and analyzed how 
the altered distributions would change the 
makeup of bird communities. “[Moving] re- 
shuffles the communities,” Voskamp says. 

As expected, tropical and subtropical re- 
gions will likely lose the most species as 
they either go extinct or shift ranges. A few 
species will move in, but most are expected 
to be closely related to one another and to 
have similar traits, Voskamp and colleagues 
reported on 19 July in the Proceedings of the 
Royal Society B. 

Northern North America and Eurasia will 
gain species as birds migrate away from 
warmer regions. But there, too, many of the 
newcomers will be closely related to species 
already present, the data showed. For ex- 
ample, the collared flycatcher (Ficedula al- 
bicollis) will move farther into northeastern 
Europe, joining its similar-size cousin, the 
pied flycatcher (F: hypoleuca). 

Taken together, the papers forecast a 
more uniform avian world, a sad prospect 
for birdwatchers as well as a blow to eco- 
systems. Some evidence suggests homo- 
genization is happening in amphibians 
and mammals, too. In the future, “It’s the 
more unique species that will be at a dis- 
advantage,” Jarzyna says. © 
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INFECTIOUS DISEASES 


Antibiotic after sex could help curb infections 


Doxycycline reduces risk of sexually transmitted diseases—but will it fuel drug resistance? 


By Jon Cohen 


new study has shown that taking a 

single dose of a widely used, cheap 

antibiotic within 3 days after condom- 

less sex can help prevent chlamydia, 

syphilis, and gonorrhea, three sexually 

transmitted infections (STIs) that have 
soared in the United States, Europe, and else- 
where over the past 2 decades. 

The study, mainly in men who have sex 
with men (MSM) in San Francisco and Se- 
attle, was halted in May after an indepen- 
dent data monitoring board found that 
the strategy, known as doxycycline post- 
exposure prophylaxis (doxyPEP), reduced the 
risk of chlamydia and gonorrhea by more 
than 60%—a result so convincing there was 
no need to continue the study. 
DoxyPEP also appeared to protect 
against syphilis, but too few cases 
occurred during the trial to reach 
statistical significance. The data 
were scheduled to be presented 
this week at the 24th International 
AIDS Conference in Montreal. 

“This is very encouraging,” says 
Carlos del Rio, an HIV/AIDS cli- 
nician and researcher at Emory 
University School of Medicine. 
But there are worries the regimen 
could trigger antibiotic resistance 
in the three bacteria that cause 
these diseases, and scientists are 
divided about whether the data 
warrant doxyPEP’s introduction 
now. “It is still a controversial 
topic,” says Jean-Michel Molina of 
the University of Paris Cité, who led a simi- 
lar, smaller study in France a few years ago. 
“T don’t think we know enough to recom- 
mend the strategy yet.” 

Doxycycline, a relative of the antibiotic 
tetracycline, has been around for more than 
45 years and is commonly used to treat and 
prevent acne and Lyme disease. It also acts 
on parasites and is widely prescribed to pre- 
vent malaria in travelers. The drug typically 
has few side effects beyond stomach upset 
and increased sensitivity to sunlight. 

The idea for the new doxycycline study 
comes from a small trial published in 2015. 
Fifteen MSM, who were at high risk of STIs 
because of their sexual behavior and already 
took medications for their HIV infections, 
added the antibiotic to their daily pill regi- 
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men. When compared at 48 weeks with 15 
similar MSM not given the antibiotic, the men 
who took doxycycline this way—a form of pre- 
exposure prophylaxis (PrEP)—had_signifi- 
cantly fewer infections with all three bacte- 
rial STIs. “That was promising—as a pilot 
study,’ says Jeffrey Klausner, an infectious 
disease specialist at the University of South- 
ern California who co-led the trial. 

Molina’s study in France, published in 
2018, was based on the same idea, but with 
a slight difference: Doxycycline was used as 
PEP, not PrEP. Participants took the drug 
once, within 72 hours after having condom- 
less sex. The team found this reduced the risk 
of acquiring chlamydia or syphilis by 70% 
and 73%, respectively, but there was no effect 
on gonorrhea rates. 


Clinician Annie Luetkemeyer examines a study participant in San Francisco. 


The new study enrolled 544 participants— 
mostly MSM, but also some transgender 
women and gender diverse people—deemed 
at high risk for STIs. Two-thirds of them 
were asked to follow a protocol similar to the 
one in the French study. The remaining par- 
ticipants received standard STI testing and 
treatment. Everyone knew what group they 
were in. “We really wanted to do a study ina 
fairly real-world setting,’ says one of the prin- 
cipal investigators, Annie Luetkemeyer, an 
infectious disease clinician at the University 
of California, San Francisco. 

Leandro Mena, who heads the division 
of the U.S. Centers for Disease Control and 
Prevention (CDC) that oversees STIs, says 
he is “very excited” about the findings. “We 
want to have the opportunity to fully review 


the data,’ Lena says, but CDC may soon issue 
preliminary guidance about the regimen for 
high-risk populations. 

Researchers differ on whether doxyPEP’s 
benefits outweigh the risk of driving anti- 
biotic resistance. Neisseria gonorrhoeae, un- 
like the bacteria that cause the other two STIs, 
readily develops resistance. DoxyPEP will 
make that worse, Molina predicts—“That’s a 
no-brainer.’ What will happen with chlamydia 
and syphilis, which have not yet evolved to 
resist antibiotics, “we don’t know—we hope,” 
he says. 

The U.S. team is still analyzing antibiotic 
resistance data from its study. But even if 
doxyPEP increases resistance in N. gonor- 
rhoeae, it won’t have a practical impact be- 
cause a different antibiotic class is now used 
to treat gonorrhea, notes Connie 
Celum, an epidemiologist at the 
University of Washington, Seattle, 
and one of the principal investiga- 
tors. Besides, she says, doxyPEP 
would only be recommended for 
people at high risk of STIs, a rela- 
tively small group. 

Celum is also involved in a doxy- 
PEP study in Kenyan women who 
take PrEP for HIV and have dispro- 
portionately high rates of STIs. “We 
need interventions now, and this is 
a safe, inexpensive, well-tolerated 
drug that really seems to have few 
downsides,” she says. And treating 
those at highest risk may help re- 
duce the spread of the STIs in the 
wider population, Celum adds. 

The strategy may have other 
downsides, however, notes Christopher 
Fairley, who heads the Melbourne Sexual 
Health Centre at Monash University, Clayton. 
Antibiotics can disrupt the bacterial micro- 
biome in the gut, for example, and resistance 
genes can jump between bacteria. Many gon- 
orrhea and chlamydia infections cause no 
symptoms, resolve without treatment, and 
are “of no great significance,” he adds. (Un- 
treated syphilis, in contrast, can damage the 
eyes, ears, and brain, and can lead to miscar- 
riage and stillbirth in pregnant people.) 

But Klausner says the benefits outweigh 
the risks. Some physicians who treat MSM 
already prescribe doxycycline for STI protec- 
tion, he notes. “Patients in the community 
know about it,’ he says. “The time is now to 
come up with recommendations.” 
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By Richard Stone 


efore embarking on his Ph.D., 
Ralph McNutt had never been east 
of the Mississippi River. But soon 
after the young Texan arrived at 
the Massachusetts Institute of 
Technology (MIT) in the fall of 
1975, he found himself on a voyage 
to the edge of the Solar System— 
and beyond. Casting around for a 
research assistantship, he ended up in the 
office of plasma physicist Herbert Bridge, 
a towering figure in space science who had 
overseen the cloak-and-dagger effort to 
dismantle and ship Harvard University’s 
cyclotron to New Mexico for the Manhat- 
tan Project during World 
War II. Bridge evidently 
saw a familiar spark in 
McNutt and invited him to 
work on a plasma detector 
for Voyager, the epic mis- 
sion to the outer planets 
that began in 1977. “I said, 
‘Where do I sign up before 
you change your mind?” 

Now, this veteran of Voy- 
ager, one of NASA's greatest 
scientific triumphs, wants 
to wheel his own passion 
project onto the launchpad. 
McNutt and colleagues at 
the Johns Hopkins Univer- 
sity Applied Physics Labo- 
ratory (APL) have laid out 
a concept for Interstellar 
Probe (IP), a $3.1 billion 
mission to pick up a sci- 
entific gauntlet that the two Voyager probes 
threw down a decade ago after leaving the 
heliosphere, the Sun’s zone of influence. Few 
expected the spacecraft to survive that long, 
yet their beguiling observations, still trick- 
ling in, have upended many beliefs about the 
Solar System’s outer limits. “A lot of our pre- 
conceived notions didn’t work out too well,” 
McNutt says. 

The Voyager data are so mystifying that 
some prominent researchers assert the 
probes haven’t made it to interstellar space 
yet, perhaps because the bounds of the 
heliosphere stretch farther than generally 
thought. Gazing out from Earth’s perch won’t 
settle the matter. “The only way to see what 
our fishbowl looks like is to be outside look- 
ing in,’ McNutt says. “We need to get modern 
instruments out there,” adds Lennard Fisk, a 
space physicist at the University of Michigan 
(UM), Ann Arbor. “In that sense, Interstellar 
Probe would be revolutionary.” 

Now, McNutt needs to convince a jury of 
his peers. His team has delivered a concept 
study of IP to the decadal survey of solar 


Can Ralph McNutt rally 
support for a mission he’s mused 
over for half a century? 


and space physics, a community exercise 
led by the National Academies of Sciences, 
Engineering, and Medicine that will set the 
field’s priorities for the next 10 years. The 
panel is set to begin deliberating next month 
and deliver its verdict in 2024. A thumbs-up 
for IP would go a long way toward securing 
NASA support for a probe that would, ide- 
ally, lift off in 2036. The timing would allow 
it to rendezvous with Jupiter and its potent 
gravity, which would sling the probe toward 
interstellar space. It would arrive about 
16 years later, in half the time it took Voyager. 

Chinese scientists are designing a simi- 
lar mission, called Interstellar Express, 
that could launch around 
the same time. Buckle up, 
enthuses Jim Bell, a plan- 
etary scientist at Arizona 
State University, Tempe, 
and past president of the 
Planetary Society. “It’s a 
space race to the edge of 
the Solar System!” 

One challenge for 
McNutt and his colleagues 
is selling a mission ex- 
pected to last at least 
50 years, requiring three or 
more generations of scien- 
tists. More daunting may be 
winning hearts and minds 
in space physics, which is 
dominated by experts on 
space weather—the solar 
flares and coronal mass 
ejections that can wreak 
havoc on satellites and power grids. “People 
are overly scared that one big project will 
suck out all the funding for the rest of the sci- 
ence we want to do,” says APL space physicist 
Pontus Brandt, chief scientist on the IP mis- 
sion concept study. But Merav Opher, an 
astrophysicist at Boston University, says 
broadening the field’s boundaries is impor- 
tant. “It’s myopic if we continue to fund just 
space weather.” 

“Voyager on steroids,” as McNutt calls IP, 
may stumble at this first hurdle. “It’s a long 
shot,’ says Bell, who doesn’t have a stake in 
the project. But IP has a powerful champion 
in McNutt, says Opher, who calls him “a fan- 
tastic mover and shaker.’ According to Bell, 
McNutt’s superb mentoring abilities will also 
be critical. “You really do have to think be- 
yond your own lifetime,” he says. 


INTERSTELLAR SPACE has been a lifelong 
pursuit for McNutt, who says he was “an 
introverted and nerdy kid” with a pas- 
sion for science fiction. One work that 
left a deep impression was Robert Hein- 
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lein’s Time for the Stars, whose prem- 
ise was the twin paradox, an early 20th 
century thought experiment that sought 
to explain a mind-bending aspect of Al- 
bert Einstein’s special theory of relativ- 
ity. In the novel, a telepathic teenager 
joins an expedition to search for habit- 
able planets around other stars; he spends 
4 years on a spaceship that can travel at 
close to light-speed. He returns home to 
find his earthbound identical twin had 
aged 71 years. That premise inspired Mc- 
Nutt, age 16, to concoct an interstellar mis- 
sion as his project for the 1970 Fort Worth, 
Texas, science fair. He laid out the physics 
hurdles of such an epic voyage and even 
crafted a prototype spacecraft from poster 
board, balsa wood, and Elmer’s glue. 

In high school, McNutt struggled to satisfy 
his thirst for science. School administrators 
“were more interested in preventing kids 
from dropping out,” but he and several fel- 
low students successfully petitioned 
for a physics course. A few years later, 
McNutt got a chance to meet the “father 
of space travel”: Wernher von Braun, a 
former Nazi rocket scientist who moved 
to the United States after World War 
II and became the chief architect of 
NASAss Moon program. Von Braun was 
giving a talk at Texas Christian Univer- 
sity and McNutt was tapped for a stu- 
dent panel that would pose questions. 
He asked von Braun whether NASA 
had plans to put humans on Mars by, 
say, 1990. That wasn’t in the cards, von 
Braun responded dryly. Instead, he said, 
the space agency would focus on robotic 
probes. “I was really irritated,’ McNutt 
says. “I was thinking something like, ‘What 
the hell’s wrong with you?” 

McNutt came away from the encounter 
with von Braun’s autograph—now in his base- 
ment, along with the science fair model—and 
a fierce determination to become a space 
scientist. He had a knack for math—“I used 
to do slide rule speed competitions,’ he 
confesses—and majored in physics at Texas 
A&M University, College Station. At MIT, as a 
junior member of the Voyager team, he got to 
go to Cape Canaveral for Voyager 1’s launch in 
1977, and he vividly remembers a visit 2 years 
later to mission control at the Jet Propulsion 
Laboratory (JPL). TV monitors in JPL’s caf- 
eteria were showing the first images of Io, Ju- 
piter’s flamboyantly colorful volcanic moon. 
“Tt looked like a rotting orange or a pizza pie. 
I thought, ‘Oh my God, it’s so beautiful?” 

Voyager's revelations about the enigmatic 
outer planets kept coming. And the daunt- 
less probes kept going. By the early 2000s, 
it seemed plausible that one or both would 
reach the heliopause—the boundary between 
the heliosphere and interstellar space, where 
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the Sun’s blast of charged particles, the solar 
wind, peters out. Tempering that thrilling 
prospect was the fact that the probes were 
engineered primarily to interrogate Jupiter's 
powerful magnetosphere, not the far weaker 
fields and particles of the interstellar me- 
dium. “By today’s standards, the information 
you can get from the Voyager spacecraft is 
primitive,’ Bell says. Still, McNutt adds, “The 
fact that we could get something was a whole 
lot better than nothing.” 


A BIG SURPRISE came in 2007, when Voy- 
ager 2, diving below the ecliptic plane in 
which the planets orbit, crossed the ter- 
mination shock: where the solar wind first 
starts to falter as it is buffeted by the inter- 
stellar gas and dust the Solar System is bar- 
reling through. Voyager 1 had crossed the 
shock 3 years earlier, some 94 astronomical 
units (AU) from Earth. (One AU, the aver- 
age distance between Earth and the Sun, is 


Interstellar Probe would fly twice as fast as Voyager. 


approximately 150 million kilometers.) But 
its plasma detector had failed at Saturn in 
1980, so it could not measure the slowing 
of the solar wind. Models had predicted 
the wind would decelerate from 1.2 million 
kilometers per hour to about 300,000 kilo- 
meters per hour. But Voyager 2 clocked a 
windspeed of 540,000 kilometers per hour. 
“Going through the termination shock, peo- 
ple said, ‘WTF?’” Brandt says. 

Also mystifying was that Voyager 2 crossed 
the shock a full 10 AU closer to Earth than 
Voyager 1. After a Voyager team member 
broke the news at a conference in Switzerland, 
“Everybody was like, “‘What’s going on?’ says 
APL’s Elena Provornikova, IP’s heliophysics 
lead, who was then at the Russian Academy 
of Sciences’s Space Research Institute in Mos- 
cow. “We immediately started talking about 
what could cause this asymmetry—what 
could be the physics behind it.” 

Space physicists later worked out that 
models had largely ignored interstellar mag- 
netic fields, which compress the heliosphere 
below the ecliptic, Provornikova says. The 


models also assumed the solar wind is a 
steady gale. But it fluctuates with the Sun’s 
ll-year cycle of magnetic activity, another 
reason why the two probes reached the shock 
at different distances. 

To explain the weakness of the termina- 
tion shock, space physicists turned to find- 
ings from other planetary probes such as 
Cassini, the spacecraft that peeled away the 
mysteries of Saturn and its rings. One was a 
better understanding of “pickup” ions: neu- 
tral atoms, primarily hydrogen, from inter- 
stellar space that become ionized when they 
encounter the solar wind or the Sun’s ultra- 
violet radiation. “Voyager was not equipped 
to measure pickup ions,’ Brandt says. “And 
these are really central here.” Scientists in- 
ferred that pickup ions riding along with the 
solar wind would gain enough energy cross- 
ing the termination shock to explain why the 
wind did not slacken as much as predicted. 
After navigating that first boundary, the 

Voyager probes entered the heliosheath, 

the region where the diminished solar 

wind continues to wither under a fusil- 
lade of gas and dust as the Solar System 
plows through space. Before Voyager’s 
encounter, the heliosheath was viewed 
as the heliosphere’s thin “skin.” But with 

a stronger solar wind emerging from 

a weak termination shock, the sheath 

ought to be thicker. The solar wind 

would run farther before sputtering to 

a stop at the heliopause, where the hot, 

wispy plasma of our heliosphere gives 

way to the cold, dense plasma of inter- 
stellar space. 
Without a working plasma detector, 

Voyager 1 was hard-pressed to confirm 
that picture. But in early 2013, mission sci- 
entists, sifting data from other detectors, 
declared that the probe had in fact left the 
heliosphere months earlier, on 25 August 
2012—some 122 AU from Earth. A precipitous 
drop in higher energy solar wind ions and a 
concomitant rise in cosmic rays clinched the 
case, they said. Six years later, Voyager 2 hit 
the heliopause at nearly the same distance 
from the Sun, in a different phase of the solar 
cycle—suggesting that unlike the termina- 
tion shock, the heliopause is insensitive to 
solar variation. “That was just incredible,’ 
Provornikova says. 

Other data didn’t add up. The Sun’s mag- 
netic field, embedded in the solar wind, is 
twisted into a spiral by the Sun’s rotation. 
Crossing the heliopause, Voyager 1 should 
have observed a change in the direction of 
the magnetic field, as the solar wind’s twist- 
ing field gives way to differently oriented 
interstellar fields. “But it was basically 
the same damn direction from the Sun,” 
Brandt says. “All the people that know the 
theory behind it, they’re perplexed.” 
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Fisk thinks it’s a sign the probes haven’t yet 
reached interstellar space. In the 1 March is- 
sue of The Astrophysical Journal, he and UM 
colleague George Gloeckler propose Voyager 
land 2 are still in the heliosheath, where they 
have encountered a unique plasma contain- 
ing two magnetic fields: not just the field 
embedded in the wind, but an additional 
one created by mobile ions that aren’t swept 
up into the wind. “The physics changes dra- 
matically when you account for that,” he says. 
Provornikova and others hold firm that the 
probes are in interstellar space, arguing that 
the solar wind’s magnetic field evidently dis- 
sipates over much greater distances than ear- 
lier models augured. “I don’t see a scenario 
where Voyager’s still inside the helio- 
sphere,” Opher says. 


NO MATTER WHO’S RIGHT, scientists 
find Voyager’s interstellar space oddi- 
ties irresistible. “Voyager didn’t give us 
the answers we're looking for, and we 
should capitalize on that,’ Fisk says. Mc- 
Nutt agrees, and in 2017 he assembled 
a 45-person team of collaborators—in- 
cluding Fisk and Opher—to flesh out a 
mission concept. Scientists have con- 
templated an interstellar mission for 
50 years, since before Voyager, McNutt 
says, but “nobody had sat down and run 
the numbers and done the engineering.” 
The group released its 498-page report 
at the American Geophysical Union 
meeting in December 2021. 

The mission concept study deci- 
sively settles one engineering question: 
whether to launch the probe toward the 
Sun and use its enormous gravity as a 
slingshot, an idea called the Oberth ma- 
neuver. After huddling with experts at 
a thermal materials firm, the IP team 
worked out that the heat shield needed 
for the probe to pass so close to the Sun 
would add too much mass—and risk. 
“You wouldn’t get there any faster” compared 
with a conventional launch of a lighter probe, 
McNutt says—provided the launch is on a 
heavy lift rocket with a rare third and fourth 
stage. McNutt is eyeing NASA's Space Launch 
System, a mammoth rocket, bigger than the 
Saturn V, that could see its first launch this 
summer as NASA contemplates sending as- 
tronauts back to the Moon. And he has put 
out feelers to SpaceX and Blue Origin about a 
ride on one of the big launchers those private 
companies are developing. After the gravity 
boost from Jupiter, IP should peak at speeds 
of more than 7 AU per year, about twice as 
fast as the Voyager probes. 

China’s Interstellar Express mission would 
send two probes in opposite directions: one 
toward the heliosphere’s nose, where mod- 
elers think it is squashed by the oncoming 
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wind of particles in interstellar space, and 
the other toward its tail. Observations from 
both missions “will give us a more compre- 
hensive picture of the heliosphere,” says 
Wang Chi, director-general of the National 
Space Science Center of the Chinese Acad- 
emy of Sciences. When his team proposed 
the mission in 2014, it envisioned a third 
probe that would launch on a path perpen- 
dicular to the ecliptic plane, using nuclear 
propulsion to escape the heliosphere. But the 
technical challenges are daunting, and that 
probe for now is on ice. “As the old Chinese 
saying goes, a journey of a thousand miles 
begins with a single step,’ Wang says. “We 
should make the two probes successful first.” 


Voyager 2 being packaged up prior to launch in 1977. 
It would cross the heliopause 41 years later. 


Although McNutt says the Chinese team 
“holds their cards close to their vest,’ he, too, 
views the missions as complementary. “The 
more the merrier!” he says. “To the extent 
you'll get different cuts through the helio- 
spheric structure and the nearby interstellar 
medium, you'll learn a lot about what’s going 
on out there.” 

APL’s concept report lays out a smorgas- 
bord of science IP could tackle, depending 
on the instruments it carries. High on the 
list is a suite of four detectors that would 
measure particles across a broad energy 
spectrum—from the chilliest plasma and 
feeblest pickup ions on up to cosmic rays 
hot enough to fry a DNA strand. “With 
Voyager we have giant energy gaps,” says 
APL physicist Alice Cocoros. Better detec- 
tion of pickup ions may be the most im- 


portant capability, Brandt says, with space 
physicists just beginning to appreciate the 
unheralded role they play at the helio- 
sphere’s edges. 

A dust detector would cure another Voy- 
ager blind spot. “We know pretty much 
nothing about how much interstellar dust 
actually gets into the Solar System,” Provo- 
rnikova says, or how it interacts with the 
solar wind. On its outward journey IP 
could also map the cloud of dust in the 
outer reaches of the Solar System, left- 
over from its formation. The contours of 
this “zodiacal” dust could refine formation 
models, but they are largely unknown be- 
cause measurements have only been taken 
from inside the cloud, McNutt says. 

Legging it out beyond the zodia- 
cal cloud would offer another perk: an 
unobscured view of the extragalactic 
background light (EBL)—the sum of all 
radiation produced since the big bang. 
The New Horizons spacecraft, cruising 
out beyond Pluto, recently uncovered 
a mystery when it observed a patch of 
dark sky and recorded about twice as 
much visible light as the current cen- 
sus of galaxies can explain, the mission 
team reported in the 1 March issue of 
The Astrophysical Journal. Equipped 
with the right instruments, McNutt 
says, IP “could for the very first time de- 
termine the EBL’s absolute brightness” 
across all wavelengths. 

Once in interstellar space, IP could 
also follow in the tradition of other far- 
ranging probes and look back toward 
home. But instead of the pale blue dot of 
Earth, it would capture an image of the 
entire heliosphere. “You can solve this 
in one fell swoop,’ McNutt says, using 
a unique camera that images a swirling 
Van Gogh-esque nightscape of energetic 
neutral atoms (ENAs) generated in the 
heliosheath when solar wind ions collide 
with interstellar hydrogen atoms, neutraliz- 
ing the ions. High-energy ENAs—those above 
50 kiloelectron volts (keV)—are especially 
revealing. “Simulations show that once you 
get above 50 keV, something remarkable hap- 
pens—you start seeing images of the shape 
of the heliosphere,” Brandt says. But he says 
with a grin, “It will probably be the most ex- 
pensive picture in history.” 

NASA got a preview in 2008, when it put 
a spacecraft the size of a bus tire called In- 
terstellar Boundary Explorer (IBEX) in orbit 
around Earth. Its two ENA cameras captured 
the first all-sky map of ENAs in the outer 
heliosphere—and they revealed a stunner: a 
winding ribbon that is richer in ENAs than 
surrounding areas. “The great circle in the 
sky,’ as Brandt calls it, may be a region just 
beyond the heliopause where ions snared in a 
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magnetic field spawn ENAs. IP’s ideal launch 
in 2036 would shoot the probe right through 
the ribbon. 

Early IBEX data supported the traditional 
notion of a comet-shaped heliosphere, with a 
tail extending two to three times farther into 
space than the nose. But subsequent mea- 
surements from IBEX, Cassini, and Voyager 
point to a more rounded heliosphere, and 
recent modeling suggest it is concave on one 
side, like a croissant. As an encore to IBEX, 
NASA in 2025 plans to launch the Interstellar 
Mapping and Acceleration Probe (IMAP), 
which would peer out at the heliopause from 
an orbital station between the Sun and Earth 
with much finer imaging resolution. “IMAP 
will bring a lot to the party,’ McNutt says. But 
TP, he says, will be able to provide the most 
revealing ENA map of all once it exits the 
heliosphere and snaps that picture of high- 
energy ENAs lighting up the heliosheath. 


THE SCIENCE wouldn’t stop after the probe 
reaches interstellar space. Opher says IP 
would be a “game changer” in our under- 
standing of the interstellar clouds of gas and 
dust the Solar System hurtles through dur- 
ing the Sun’s 230-million-year orbit around 
the center of the Milky Way. Like oases in 
a desert, these clouds are likely remnants 
of stellar nurseries, rich wellsprings of hy- 
drogen that collapsed under gravity to form 
stars. Space physicists have put together a 
rudimentary cloud atlas of the local inter- 
stellar neighborhood. “It looks like a child’s 
sketch—but that’s all we’ve got,’ Brandt says. 
IP would directly sample gas, dust, and 
other properties of the Local Interstellar 
Cloud, the Solar System’s home for the past 
60,000 years. And by measuring the absorp- 
tion of starlight by dust and hydrogen atoms, 
it could probe the nearby G cloud, into which 
we will plunge in the next 2000 years—if the 
transition hasn’t already begun. “We have no 
clue what’s going to happen next,’ Brandt 
says. The denser and colder a cloud is, the 
more momentum it will sap from the solar 
wind. That could squash the Sun’s magnetic 
cocoon, to our biosphere’s detriment. 
Voyager found that 75% of the cosmic rays 
heading our way from interstellar space get 
filtered out in the heliosphere’s outer reaches. 
If the encounter with the next cloud squeezes 
the heliosphere all the way down to Earth’s 
orbit, life forms would be exposed to an in- 
tense radiation environment that would rid- 
dle DNA with mutations, Brandt says. 
There’s evidence of such an event around 
the time early hominids were just beginning 
to pick up stone tools, and Brandt muses on 
a possible connection. “Let that creep up 
your spine for a moment,” he says. In recent 
years, scientists have discovered iron-60 
isotopes in ocean crust samples dating from 
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2 million to 3 million years ago (Science, 
16 July 2021, p. 269). Iron-60 is not found 
naturally on Earth: It’s forged in the cores 
of large stars. So, either a nearby super- 
nova blasted the heliosphere with the iron 
dust, or the heliosphere drifted through a 
dense cloud laden with iron-60 from a pre- 
vious supernova. Either way, Brandt says, 
“The heliosphere was way in, and we had a 
full blast of galactic cosmic rays and inter- 
stellar matter for a long, long time.” To look 
for relics of other such events, IP could use 
plasma wave antennas to essentially take 
the temperature of nearby electrons. Hot 
regions might mark the blast paths of mate- 
rial from past supernovae. 


ever—McNutt’s team is eager to put their 
sales hats back on. “We have a lot of mis- 
sions under our belt. Parker came in around 
$100 million underbudget,” Brandt says. 

In the meantime, APL is assiduously 
breeding the next generation of IP scien- 
tists. Literally. “We counted 13 babies born 
during the concept design study,’ says 
Cocoros, who has high hopes her son Luke, 
who’s about to turn 2, will become enam- 
ored with space. She’s expecting a daughter 
in November. “I guess she makes 14!” Dur- 
ing the design study, Cocoros served as a 
bridge between scientists and engineers as 
they strove for a sweet spot: an instrument 
payload that would meet key science objec- 


ee 


Mission control at NASA's Jet Propulsion Laboratory at the time of Voyager 1’s encounter with Saturn in 1980. 


The IP team is thinking big in other 
ways—even the possibility that the probe will 
ultimately stray near another star and fall 
into alien hands. Each Voyager probe car- 
ries a golden record filled with music and 
voices sampling Earth’s cultures. IP would 
likely bear an updated digital version: a 
thumb drive, perhaps, offering a flavor of 
life on Earth for aliens cosseted in their own 
heliospheres—as long as an extraterrestrial 
IT department can figure out how to read it. 


IF THE DECADAL SURVEY endorses IP and 
NASA embraces it, the space agency would 
need to persuade Congress that the Star 
Trekkian emissary is worth the price tag— 
and then decide which lab would hon- 
cho it. Fresh off APL’s latest success—the 
$1.5 billion Parker Solar Probe, which is 
flying closer to the Sun than any mission 


tives without growing too fat for a conven- 
tional launcher. “I loved being the glue in 
the middle of the project,” she says. 

Cocoros sees McNutt as a mentor. “I 
adore him. If you ask him a question, he 
tells you a story. A novel,” she says. She 
says his wealth of knowledge is mirrored 
in his APL office, which is cluttered with 
the memorabilia of a life devoted to space. 
“Tt’s like his brain. Piles and piles of VHS 
tapes of past missions, giant binders of I 
don’t know what.” 

With future IP leaders waiting in the 
wings—and some still in diapers—McNutt 
hopes his push for the stars will at last get 
the go-ahead. “We don’t want to kick this 
can down the road farther than it already 
has been,” he says. For this Voyager veteran, 
after all, it’s a trip that’s been a half-century 
in the making. 
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Seeing inside a body in motion 


Adhesive ultrasound patches can provide medical imaging for patients on the go 


By Philip Tan! and Nanshu Lu*2346 


Itrasonic imaging is one of the most 
powerful and commonplace medi- 
cal tools for noninvasive visualiza- 
tion of soft tissues inside the body. 
However, ultrasonography requires 
highly trained sonographers to po- 
sition and orient the transducer on the 
surface of the patient’s body, and image 
quality is highly dependent on the steadi- 
ness of the operator’s hand. Because of this, 
ultrasonic imaging has been largely limited 
to short and static sessions not only for lo- 
gistical reasons but also because of the very 
real threat of musculoskeletal injuries for 
sonographers from repetitive motions as- 
sociated with transducer manipulation (J). 
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Coincidentally, there has also been a sonog- 
rapher shortage worldwide over the past 
decade, and the demanding and specialized 
training required for certifying sonogra- 
phers does not help alleviate this problem 
(2). On page 517 of this issue, Wang et al. (3) 
introduce a bioadhesive ultrasound (BAUS) 
patch with the potential to overcome many 
of these outstanding challenges. 

Over the past few years, several wearable 
ultrasound patches have been developed 
(4-7). Although there have been improve- 
ments in transducer thinness and softness, 
none satisfy all the patch-tissue-interface 
requirements for high-quality ultrasonic 
imaging. To be ready for real-world appli- 
cations, the patch must have stable adhe- 
sion to the body, the acoustic energy from 
the transducer must be transmitted effi- 
ciently into the body, and the device must 
not cause skin irritation. In a standard ul- 
trasound exam, liquid gel is the de facto 
couplant, that is, a material that improves 
acoustic transmission at the transducer- 
tissue interface. However, conventional 
ultrasonic gel is only suitable for short im- 
aging sessions because it dries out or drips 


1” 


down over time. Conversely, elastomer (4, 5, 
8) and hydrogel (8) couplants have been ex- 
plored but tend to have less-than-desirable 
transmissivity or usable time, respectively. 

To overcome these weaknesses, Wang et 
al. combined the strengths of both elasto- 
mers and hydrogels by encapsulating a soft 
hydrogel in a thin elastomer membrane 
and coating it with a bioadhesive material. 
This hydrogel-elastomer hybrid, which 
they named the BAUS couplant, enables 
more than 48 hours of adhesion to the skin 
and sustained transmissivity. The soft and 
adhesive BAUS couplant also alleviates the 
stiffness of a rigid transducer array, mak- 
ing it more comfortable to wear. The rigid 
transducer array allows for a high trans- 
ducer density, with 400 transducers per 
square centimeter. 

The BAUS patch opens exciting oppor- 
tunities for increasing accessibility to ul- 
trasound exams. Although, at present, the 
patch still needs to be attached to a nonport- 
able data acquisition system for processing 
the images, it could provide immediate re- 
lief for sonographers, for example, during 
sessions when a patient is undergoing anes- 
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thesia and constant monitoring of cerebral 
blood flow is required (9). In this scenario, 
the BAUS patch can simply be stuck to a pa- 
tient’s temple and provide stable blood flow 
imaging. The patch also opens opportuni- 
ties for long-term, dynamic, high-resolution 
imaging of internal organs. For example, 
discreet monitoring of bladder volume 
could help people who suffer from urinary 
dysfunction. The inability to sense bladder 
fullness is a cause of urinary incontinence, 
leading to physiological and psychological 
injury, and a portable ultrasonic device for 
measuring bladder fullness can provide pri- 
vate alerts for these individuals to relieve 
themselves (10). For managing cardiovascu- 
lar disease, stress echocardiograms provide 
valuable insights into cardiac functions, 
including valvular regurgitation, valve gra- 
dients, chamber volume, and myocardial 
strain tracking (17). However, the difficulty 
of maintaining the transducer probe during 
motion puts substantial logistical limits on 
these exams. 

Thanks to excellent coupling to the tis- 
sue, the BAUS patch can capture high-qual- 
ity images during a wide range of motions 
(see the figure). Muscular imaging enables 
quantification of muscle contraction, which 
enhances athletic training and rehabilita- 
tion of neuromuscular disorders (12). Also, 
ultrasonic imaging of the lungs has been a 
powerful tool for screening for respiratory 
diseases such as COVID-19 (13). A “set- 
and-forget” ultrasonic patch would allow 
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for continuous monitoring of contagious 
patients with minimal exposure risk to the 
medical staff. 

However, there do exist several limita- 
tions of the BAUS patch that will require fu- 
ture improvement. Wang et al. introduced 
a plug-and-play snap connector system that 
could link to several dozen transducers. 
However, three-dimensional ultrasonic im- 
aging will require thousands of transducers 
to provide sufficient resolution. Controlling 
thousands of transducers requires extensive 
circuitry and hardware, which will limit 
maneuverability and mobility. Forming in- 
terconnects between transducers and the 
circuitry becomes more challenging with 
more transducers involved. Research into 
“ultrasound on a chip” will offer the much- 
desired integration of transducers with 
requisite analog and digital processing cir- 
cuitry. The BAUS couplant has the poten- 
tial to improve the long-term wearability of 
these systems, despite differing transducer 
array fabrication approaches. 

To make ultrasound completely user- 
friendly for the average patient, artificial 
intelligence technology may also be needed 
to assist with placement and beam steering. 
Researchers have demonstrated an interac- 
tive application using a deep-learning model 
that guides the operator to manipulate the 
transducer to produce the best image of a 
target anatomy (/4). Moving forward, au- 
tomated image analysis and interpretation 
will be just as important as image acquisi- 
tion. The road to providing universal acces- 
sibility for mobile ultrasonic imaging will 
require concerted efforts in diverse fields of 
engineering, science, and medicine. ® 
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Targeting 
brain cancer 


A new drug is designed 
to hit an old target 
with increased precision 


By Roger Reddel and Adel Aref 


lioblastoma (GBM) is the most 

common form of malignant tumor 

originating in the brain and is one 

of the most lethal cancers, with 

only 1 in 20 patients surviving for 5 

years after diagnosis (1). The relent- 
less growth of cancers such as GBMs re- 
sults from DNA mutations and epigenetic 
changes, but these also create opportuni- 
ties for targeted therapies. On page 502 
of this issue, Lin et al. (2) describe a new 
approach to killing GBM cells that targets 
a well-known molecular change in GBMs: 
the decreased expression of the DNA repair 
enzyme O*%-methylguanine methyltransfer- 
ase (MGMT). The authors develop a new 
agent that exploits this reduced DNA re- 
pair capacity and propose that their ap- 
proach could be generalized to other can- 
cers with defective DNA repair. 

The expression of the MGMT gene is 
repressed by DNA methylation of its pro- 
moter. This occurs in ~50% of GBMs and 
>70% of lower-grade gliomas (grades II 
and III), and less frequently in other can- 
cers (3). How this change contributes to 
oncogenesis is unclear, but MGMT inter- 
acts with the p21 and proliferating cell 
nuclear antigen (PCNA) cell cycle proteins 
(4), so lack of MGMT protein presumably 
contributes to unrestrained cell prolifera- 
tion. What is much clearer is that MGMT 
provides a critical function in maintain- 
ing the integrity of DNA, with its primary 
function being to repair O°-methylguanine 
DNA adducts, which cause base-pairing 
with thymine rather than cytosine and 
thus can introduce mutations during DNA 
replication. MGMT repairs these adducts 
through a “suicide” mechanism in which it 
transfers the methyl group from damaged 
DNA to the sulfur group in its internal ac- 
ceptor site, inactivating itself (3). 

One of the few therapeutic interventions 
that has well-documented benefit in GBM 
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Inducing toxic DNA lesions in glioblastoma 

The drug KL-50 was designed to deposit an 0°-2-fluoroethy! (O°FEt) lesion on guanine bases that can be 
rapidly removed by the O®-methylguanine methyltransferase (MGMT) enzyme. If not removed because 

of MGMT deficiency in a cancer cell, the lesion is slowly converted to a highly deleterious interstrand cross-link, 
leading to cancer cell death by a mechanism independent of the mismatch repair pathway. 
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Rapid removal 
by MGMT 


is the combination of radiotherapy and an 
alkylating drug, temozolomide (TMZ) (5). 
TMZ is a prodrug that converts to an al- 
kylating agent under physiological condi- 
tions and causes various types of lesions in 
DNA, most of which are readily repaired by 
the base excision repair pathway. A small 
percentage of the lesions, however, are O°- 
methylguanine adducts, so GBMs with low 
MGMT expression accumulate these TMZ- 
induced DNA adducts. This leads to cell 
death through a mechanism that involves 
the mismatch repair (MMR) pathway (6) 
and is an example of a synthetic lethal in- 
teraction between a drug and a genetic or 
epigenetic change (7). MGMT deficiency is 
therefore a biomarker for response to TMZ 
in GBM and grade II and III gliomas. By 
contrast, in other types of tumors, MGMT 
deficiency is either not predictive of TMZ 
response, or its predictive capability has 
not yet been established (8). In most 
MGMT-deficient GBM patients, the initial 
response to TMZ is eventually followed by 
tumor recurrence coincident with muta- 
tions in genes that encode MMR proteins 
and the development of a hypermutator 
phenotype. 

Lin et al. describe a new approach to 
exploiting MGMT deficiency in GBM cells 
(see the figure). They designed agents that 
create a primary DNA lesion that can be re- 
versed by MGMT, but if this does not occur, 
the lesion evolves into a toxic DNA inter- 
strand cross-link (ICL). ICL formation oc- 
curs at a rate that is slow enough for most 
or all of the primary lesions to be repaired 
in normal cells for which the drugs should 
therefore have low toxicity. ICLs cause cell 
death in a manner that cannot be evaded 
by MMR mutations, so this design should 
also decrease the acquisition of resistance. 
One of these agents, KL-50, was shown 
to cause maximal ICLs 8 to 24 hours af- 
ter treatment and to retain selectivity for 
MGMT-deficient cancer cells as well as 
killing MMR-deficient cells for which TMZ 
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DNA interstrand 
cross-link 


8 to 24 hours in the | 
absence of MGMT —, 


was ineffective, confirming that the MMR 
pathway is not involved. Moreover, KL-50 
appeared to have an acceptable toxicity 
profile in mice xenografted with human 
GBM cells. There will be many steps re- 
quired to take this potential therapeutic 
agent from preclinical studies to human 
trials. It will be of particular interest to 
determine whether changes in DNA meth- 
ylation of the MGMT promoter (9) emerge 
as a mechanism of resistance, and whether 
any of the mechanisms other than MMR 
mutations that result in TMZ resistance 
(10) can also result in resistance to KL-50. 

Mutations or epigenetic changes that 
result in impairment of one or more DNA 
repair pathways occur in many different 


“..dtheir approach could 
be used to exploit DNA repair 
deficits in cancer cells 
other than 0°-methylguanine 
methyltransferase 
deficiency...” 


types of cancer, especially as a result of 
treatment (JJ), and mutation of various 
DNA repair genes is the most common 
known cause of hereditary cancer. Lin et 
al. propose that their approach could be 
used to exploit DNA repair deficits in can- 
cer cells other than MGMT deficiency for 
therapeutic purposes. The principle would 
be the same: Modify the agent so that it 
produces an initial lesion that is rapidly 
repaired in DNA repair-proficient normal 
cells—but not when the target DNA repair 
pathway is deficient—and a secondary 
toxic lesion that develops slowly. 

While Lin e¢ al. are improving the drugs 
used to hit a well-established target— 
MGMT deficiency—the search continues 


for new treatment targets in GBM. Despite 
rapidly increasing knowledge about the 
biology of GBM, profiling of the GBM ge- 
nomic landscape, and identification of ac- 
tionable targets (72, 13), clinical outcomes 
have not improved substantially in the past 
decade. A network meta-analysis of 40 ran- 
domized controlled clinical trials, which 
included patients with recurrent GBM 
treated with various regimens (including 
single agents and combinations of tyro- 
sine kinase inhibitors, chemotherapy, im- 
munotherapy, and antiangiogenic agents), 
showed no significant difference in overall 
survival between the different regimens 
(with overall survival ranging from 3 to 18 
months) (/4). 

The reasons that GBM is so difficult to 
treat are thought to include the extent to 
which tumor cells infiltrate the surround- 
ing tissue, making surgical cure largely 
impossible; an intrinsically high level of 
resistance to radiotherapy and chemo- 
therapy; an immunosuppressive tumor 
environment; and the extensive hetero- 
geneity among GBMs in different pa- 
tients and within individual tumors (15). 
Improvements in outcome are likely to 
come from increased understanding of the 
tumor biology. The history of the develop- 
ment of successful cancer treatments also 
suggests that outcomes will be improved 
by the use of treatment combinations. In 
this context, combining a potentially im- 
proved approach to targeting MGMT defi- 
ciency with other therapies such as radio- 
therapy, another targeted therapy, virolytic 
agent, and/or immunotherapy could be a 
promising step forward. 
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MOLECULAR BIOLOGY 


Molecular recorders to track cellular events 


DNA tapes could be used to record dynamic molecular and cellular events in animals 


By Nanami Masuyama?”, Naoki Konno?, 
Nozomu Yachie" 


tarting from a fertilized egg, cells pro- 
liferate, pass their genomic informa- 
tion to their progeny, and dynamically 
change their functions to form diverse 
tissue structures. Throughout develop- 
ment, intracellular and environmental 
cues trigger patterns of gene expression that 
govern cell state transitions and produce fur- 
ther cellular and environmental cues, leading 
cells to self-organize into functional clusters 
in spatially distinct areas. How can these pro- 
cesses be investigated? High-resolution mo- 
lecular snapshots of cells can be obtained by 
various omics technologies, but these require 
sample destruction, which precludes analyses 
over time. Live-cell imaging with fluorescent 
probes can analyze time-course dynamics but 
is limited to a small number of molecules or 
cells. These challenges could be overcome by 
DNA event recording, where molecular and 
cellular information is progressively stored 
in synthetic “DNA tapes” in cells within a 
multicellular organism, and the stored infor- 
mation is accessed by high-throughput DNA 
sequencing. However, there are technical 
limitations that need to be overcome. 

One of the DNA event recording modali- 
ties, dynamic cell lineage recording using 
CRISPR-Cas9 genome editing, has acceler- 
ated developmental cell lineage tracing. In 
this system, chromosome-embedded_ syn- 
thetic DNA barcode arrays are continuously 
and randomly mutated by Cas9, which is 
recruited by the expression of many barcode- 
targeting guide RNAs (gRNAs); these mu- 
tated arrays are progressively inherited by 
daughter cells (J-3) (see the figure). Similar 
to an evolutionary tree reconstruction from 
genomic sequences of diverse species, the 
cell lineage of an animal initiated from a 
fertilized egg can be reconstructed accord- 
ing to the mutation patterns of the barcode 
sequences in descendant cells at the time of 
observation using high-throughput DNA se- 
quencing. Such lineage tracing systems have 
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also been linked to single-cell RNA sequenc- 
ing (scRNA-seq), where individual cell types 
identified by scRNA-seq can be linked to 
their historical cell lineage information (3). 

Today, new technologies are rapidly re- 
vising perspectives on developmental biol- 
ogy, such as the origins of the hypoblast and 
embryo-restricted epiblast (4), blood cell dif- 
ferentiation, and the deciphering of cell pro- 
liferation dynamics of metastatic cancers (5, 
6). Furthermore, ideas to develop other cell- 
embedded circuits that can convert environ- 
mental and cellular signals to genome-editing 
activities have emerged (2). Combined with 
a cell lineage recorder, such systems would 
enable the computational reconstruction and 
mapping of historical cellular events and cell 
state changes onto a cell lineage. These ideas 
could lead to the characterization of high- 
resolution cell lineages, cell state changes, 
and cell communications during whole mam- 
malian body development, homeostasis, and 
malfunction. This would deepen knowledge 
about developmental and cancer biology that 
could be translated into therapeutic strate- 
gies. However, DNA event recording has yet 
to trigger the next gold rush in biology be- 
cause there are some missing components. 
The current molecular event recording meth- 
ods need to overcome four substantial limita- 
tions in information recording capacity, avail- 
able molecular sensors to capture diverse 
biological events, the number of cells that can 
be analyzed, and computational capacity to 
reconstruct high-content biological history. 

Information recording capacity could be 
increased by extending the total length of the 
DNA barcodes. For example, several lineage 
tracing methods have used multiple short 
DNA barcode units per cell for the random 
accumulation of CRISPR-induced mutations. 
To read out the combinatorial mutation pat- 
terns in these barcodes in distal genomic loci, 
constitutively active promoters are intro- 
duced to allow transcription, and the muta- 
tion information is captured by scRNA-seq. 
The current methods use either a few copies 
of long gRNA-targeting barcode arrays (up to 
10 barcodes per array, whose size is restricted 
by short-read DNA sequencing methods) or 
many copies of short barcode arrays with 
single gRNA-targeting units. There seems 
to be no barrier to combining these two ap- 
proaches to achieve a high-content event 
recording system (i.e., encoding many long 
barcode arrays per cell). 


Altering the type of information-writing 
enzyme would also increase the information 
recording capacity. Most of the current cell 
lineage tracing systems use wild-type Cas9, 
which induces double-stranded DNA breaks 
that are toxic to cells and can also confer large 
deletions, especially when multiple breaks 
are introduced simultaneously. Additionally, 
these large deletions can overwrite acquired 
mutations and progressively collapse the bar- 
codes. These limitations could be alleviated 
by using a CRISPR base editor (BE), fusing 
a catalytically inactive Cas9 to a nucleoside 
deaminase that introduces single-nucleotide 
substitutions at the gRNA target sequence 
with minimal double-stranded DNA breaks. 
Most BEs confer unidirectional substitutions 
(e.g., CeT or A>G) (7). This could be a down- 
side, given that the lineage recording ability 
crumbles when all substrate C or A bases in 
the DNA barcodes are converted and thus de- 
pleted. However, the recent development of 
dual-function BEs has enabled the simulta- 
neous introduction of C>T and A>G substi- 
tutions. Using such a dual-function BE that 
can confer reversible C:G-T:A mutations 
will enable more-accurate lineage tracing 
than wild-type Cas9 or single-function BEs. 
In evolutionary biology, phylogenetic trees 
have been efficiently reconstructed from re- 
vertant mutations. BEs have also been dem- 
onstrated to record cellular signals but only 
for a limited number of events in mammalian 
cells, such as Wnt signaling (8). 

The CRISPR prime editor (PE), fusing a 
catalytically inactive Cas9 to reverse tran- 
scriptase, has recently been proposed for 
scalable, high-content event recording. In 
these genome-editing systems, a PE gRNA 
(pegRNA) encodes a synthetic short payload 
sequence and recruits the PE to the target 
sequence, in which—coupled with endog- 
enous DNA repair machineries—the reverse 
transcription activity of PE enables the inte- 
gration of the payload sequence to the target 
site (7). This enables more diverse data en- 
coding per barcode than wild-type Cas9 and 
BEs. Preliminary data show that the gene 
expression activities of 300 enhancers can be 
recorded to a single target DNA locus of a cell 
population using a library of pegRNAs, each 
of which is encoded downstream of a specific 
enhancer element (9). PE has also been pro- 
grammed for a cascade reaction, where the 
order of complex events can be recorded by 
arraying event-specific payload sequences 
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at a target DNA locus like a ticker tape (10). 
However, these approaches have not yet been 
implemented in animal models. 
High-content event reconstruction necessi- 
tates high-capacity scRNA-seq. A cell lineage 
recording system that confers at least one 
unique mutation per cell division would yield 
the perfect reconstruction of cell lineage if all 
cells of a body can be analyzed. Analysis of a 
limited number of cells enables the historical 
reconstruction of only an upstream cell lin- 
eage whose descendant information can be 
covered by the analytes. The current state-of- 
the-art scRNA-seq can process multiple mil- 
lions of cells, whereas a mouse body consists 
of multiple billions of nucleated cells. Given 
the rapid development of scRNA-seq since 


to capture the whole developmental process 
of an animal body. 

High-performance computing methods 
also need to be developed with these en- 
gineering efforts. Although many of the 
current cell lineages have been generated 
by repurposing existing evolutionary tree 
reconstruction tools, their algorithms need 
to be optimized for information recording 
biases in specific lineage tracing methods 
(13). Efficient algorithms for other bio- 
logical events have yet to be established. 
Furthermore, no current phylogeny estima- 
tion software tool can reconstruct a lineage 
of more than one million inputs, raising 
concerns about how to reconstruct a bio- 
logical history forming a mammalian body 


The DNA event recording concept 


Cell-embedded synthetic CRISPR-Cas information-writing enzymes can progressively record cellular events and 
lineage transitions into DNA tapes (or barcode arrays) by introducing various types of DNA alteration. DNA tapes are 
read out by single-cell sequencing, and the biological event histories are computationally reconstructed. Cell lineage 
can be recorded by continuous accumulation of random mutations into DNA tapes. Other event recording systems 
necessitate editing event-specific DNA tape positions or inserting specific DNA payloads into the DNA tape. 
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the advent of the droplet-based methods that 
enabled the analysis of multiple thousands of 
cells (11), it is not unreasonable to predict that 
technologies that can capture a greater mag- 
nitude of cells will appear in the next 5 years. 
Additionally, spatial RNA-seq has enabled 
spatial profiling of gene expression patterns 
in a tissue section or a three-dimensional 
block in high resolution (12). Coupled with 
DNA event recording systems whose readout 
information is converted into RNA, this mo- 
dality also enables the reconstruction of cell 
lineage and cell state trajectories throughout 
a physical space. However, spatial RNA-seq 
allows only the reconstruction of later events 
that contributed to its formation. Therefore, 
more-scalable methods need to be developed 
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gRNA, guide RNA; pegRNA, prime editor gRNA. 


of at least hundreds of millions of cells. A 
new computing framework called deep dis- 
tributed computing has been proposed to 
overcome this limitation (J4). This frame- 
work has enabled accurate cell lineage 
tracing of more than two hundred million 
mutated sequences. In brief, starting from 
a single computing node, it reconstructs an 
upstream lineage and assigns all the input 
sequences into its subclades, where an in- 
dependent computing node recursively re- 
solves the next upstream lineage of each 
subclade. Like a spreading virus, a single 
computing task autonomously generates 
many distributed computing tasks, each of 
which iterates the same procedure. There 
is no conceptual barrier to implementing 


other event reconstruction processes into 
this framework. 

Technologies for high-resolution DNA 
event recording have been maturing, which 
could expand the horizons of biology in dif- 
ferent animal models—perhaps starting from 
mice for mammalian species. It is expected 
that this approach will identify unknown cell 
differentiation behaviors and their underly- 
ing molecular and cellular regulatory mecha- 
nisms. Analysis of many individuals’ develop- 
mental processes in a single species in high 
resolution would characterize deterministic 
and stochastic cell differentiation patterns 
and quantify the ability of cells to undergo 
state transitions, enabling more precise com- 
putational modeling of dynamic multicel- 
lular systems with biophysical perspectives. 
Direct observation of human developmental 
procedures with the same approach is im- 
possible because of ethical considerations 
regarding the genetic manipulation of hu- 
man embryos. However, high-resolution de- 
velopmental maps of diverse nonhuman spe- 
cies and comparative genomics approaches 
could also reveal human developmental 
processes. Furthermore, as seen in the emer- 
ging CRISPR cell lineage tracing studies 
of human organoid development (75) and 
tumor progression to metastasis in a mouse 
model (5, 6), embedding a high-content DNA 
event recorder system in human pluripotent 
stem cells for organoid studies and in cancer 
cells for animal xenograft studies can more 
directly contribute to understanding their 
behaviors. Combined with genome-editing 
technologies that can reconstitute disease 
mutations in the human genome, these new 
model systems will also serve as testbeds 
with substantially expanded data monitoring 
abilities in therapeutic development for gen- 
etic disorders and cancers. 
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MARINE BIOLOGY 


Did pollination 
exist before 
plants? 


Research shows 
that seaweeds 
depend on 
crustaceans for 
fertilization 


By Jeff Ollerton'? and Zong-Xin Ren’ 


he sexual reproduction of seed 

plants involves the transfer of male 

gametes—in pollen—to their female 

gametes. In flowering plants (angio- 

sperms), this is performed with the 

stigma of flowers, whereas the gym- 
nosperms (such as conifers and cycads) pro- 
duce a diversity of structures on their repro- 
ductive axes to accomplish the same task. 
This transfer of male gametes is generally 
known as “pollination” and can be mediated 
by animals, wind, or water. Animal pollina- 
tion, principally by insects, is the dominant 
strategy for angiosperms but also occurs in 
the extant gymnosperms, as well as some 
species of mosses. Outside of these plants, 
no other group of organisms has been dem- 
onstrated to interact with animals in this 
way, until now. On page 528 of this issue, 
Lavaut et al. (1) demonstrate that a living 
species of red seaweed, Gracilaria gracilis, 
uses the isopod crustacean Jdotea balthica 
to transfer its male gametes, substantially 
extending the phylogenetic scope of species 
that use animals as pollinators. 

Although they photosynthesize, seaweeds 
are a type of algae and therefore are only 
very distantly related to what is considered 
true plants. The group to which the red sea- 
weed G. gracilis belongs is thought to have 
evolved around 1 billion years ago, long 
before plants appeared (2). The isopods 
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are not as old, but nonetheless there have 
been at least 300 million years for relation- 
ships to have developed with the seaweed. 
It is possible that before this, red seaweeds 
relied on some other now-extinct marine 
invertebrates as pollinators, but the earliest 
unambiguous arthropods only appear about 
600 million years ago. How these seaweeds 
were reproducing before this is a mystery, 
but Lavaut et al. suggest that reproduction 
in the seaweed is “ambophilous’—that is, 
its male gametes can be transported by the 
isopods or by water currents. This finding 
is worth noting because extant seed plants 
that rely on both wind and animal pollina- 
tion are widespread, and the strategy is as- 
sumed to be underreported (3, 4). 

What distinguishes the 6750 described 
species of Florideophyceae—of which G. 
gracilis is a member—from most other 
seaweeds is that their male gametes lack a 
flagellum to provide movement. It was pre- 
viously thought that they could reach the 
female reproductive structures passively 
with water currents, a common strategy 
used by some aquatic flowering plants (5). 
However, Lavaut et al. demonstrate that iso- 
pod crustaceans may be attracted to these 
seaweeds because they provide a habitat in 
which to shelter from predators and feed on 
small algae that colonize the surface of the 
seaweed. In addition to pollinating, grazing 
by the isopods removes these algae and may 
increase the growth rate of the seaweed. In 
this way, the isopods are engaged in a “dou- 
ble mutualism” (6) with the seaweed. 

Interactions between different species are 
the engine that makes ecosystems function 
effectively. Herbivores grazing on plants, 
predators consuming animals, decomposers 


Lavaut et al. demonstrate the involvement of the 
crustacean Idotea balthica (purple) in the transport of 
male gametes of the red seaweed Gracilaria gracilis 
(green). This echoes the assisted pollination of flowering 
plants, which raises questions about the ecological 
importance and the evolution of these interactions. 


breaking down organic matter and recy- 
cling it all contribute to the ultimate cir- 
cular economy of the planet. Ecologists 
consider the outcomes of these interactions 
in terms of which species benefit, which 
are negatively affected, and which have a 
neutral outcome with respect to their evo- 
lutionary fitness—that is, the ability to re- 
produce. Mutualistic interactions in which 
both species benefit are often responsible 
for maintaining populations because of 
their positive effects on individual repro- 
ductive success. Mutualism plays a crucial 
role in ecology, from gut microbes that fa- 
cilitate the digestion of cellulose in herbi- 
vores, to fish and sea anemones that protect 
each other from their respective predators, 
to plant-pollinator relationships (7). 
Relying on a third party to facilitate re- 
production is obviously a vital way for or- 
ganisms that cannot move around to find 
mates. Although it was once considered the 
hallmark of flowering plants, researchers 
are discovering that this kind of interaction, 
in which animals bring together different 
gametes or mating types of sedentary or- 
ganisms, may be found in taxa that evolved 
long before angiosperms and gymno- 
sperms, such as mosses (8) and fungi (9, 10). 
The study by Lavaut et al. has broadened 
both the variety and the history of animal- 
mediated male gamete transfer, taking the 
concept of pollination from plants to algae 
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and potentially pushing it back to the earli- 
est evolution of marine invertebrates. 

The reliance on an animal to ensure re- 
production is not without its risks, espe- 
cially in a world of rapid anthropogenic 
change. As Lavaut et al. point out, most 
reproduction in this seaweed occurs in 
the relatively still waters found in coastal 
rock pools at low tide. These habitats are 
threatened by factors such as water pollu- 
tion, invasive species, physical modifica- 
tion of the coastline, and changing climate 
and sea level (77). In addition, crustaceans 
in general are susceptible to the increasing 
ocean acidification related to rising atmo- 
spheric CO, concentration. Ocean acidity 
can weaken their exoskeletons through 
decalcification, although this effect has 
yet to be thoroughly examined for isopods 
(12). Seaweeds, including Gracilaria graci- 
lis, are also ecologically important for the 
ecosystems in which they occur and are 
increasingly being touted as a sustainable 
food source (13, 14). Marine environments 
are under huge anthropogenic pressure, 
however, and Lavaut et al. demonstrate the 
importance of preserving not just species 
within ecosystems but also the interaction 
networks among different species. 

There is so much work still to be done in 
this area of ecology and discoveries to be 
made. Despite being the most extensively 
studied group of plants, recent estimates 
suggest that only 10% of the more than 
300,000 known species of animal-polli- 
nated flowering plants have had their polli- 
nators documented (75). There are no doubt 
many more revelations awaiting the careful 
observer of species interactions. 
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MOLECULAR BIOLOGY 


The material properties 
of chromatin in vivo 


Chromatin is fluidlike within the crowded 
nucleus when probed in a living cell 


By Woong Young So and Kandice Tanner 


he physical organization and material 
properties of the genome can regu- 
late how genes are expressed and ge- 
nome stability (7, 2). On page 489 of 
this issue, Keizer et al. (3) report that 
chromatin behaves like a fluid in the 
nucleus of a living cell. Within minutes of ap- 
plying magnetic forces directly to a marked 
locus within the genome of live cells, the au- 
thors observed displacement by several mi- 
crometers. Drawing on theoretical models in 
polymer physics, they determined that chro- 
matin behaves as a free polymer in a viscous 
environment, with moderate interactions 
from the surrounding chromatin. These find- 
ings suggest that chromatin has a higher de- 
gree of flexibility than if it were to behave like 
a solid or gel-like material during interphase, 
as currently proposed (4-7). These results 
have implications for overall compaction of 
the genome and the coupling between me- 
chanical and biochemical signaling needed 
for genome integrity and function. 
Magnetic forces have previously been 
used to probe chromatin directly in living 


cells (4, 7). However, these studies were un- 
able to resolve single-chromatin dynamics. 
Keizer et al. solve the technical challenges 
of directly probing the mesoscale mechani- 
cal properties of chromatin in the context 
of local crowding in the surrounding nu- 
cleoplasm using magnetic nanoparticles 
as local sensors. Recombinant enhanced 
green fluorescent protein (eGFP)-labeled 
ferritin cages encapsulating a magnetic 
core were functionalized with a dimerizing 
protein construct of mCherry and anti- 
GFP fused with the tetracycline repressor 
(TetR). They were injected into the nuclei 
of cells that were engineered to contain 
numerous tetracycline operator (TetO) se- 
quences within the genome. Upon docking 
with TetO, dimerization of the TetR facili- 
tated binding to the locus (see the figure). 
These cells are cultured on magnetic mi- 
cropillars that serve as conduits of subpi- 
conewton forces after the application of 
an external magnetic field. These techni- 
cal innovations supersede previous studies 
where the probes, a micrometer in diam- 
eter, were orders of magnitude larger than 
genomic loci (4). The application of forces 


Probing chromatin behavior 


Keizer et al. used an innovative probe to measure the mechanical properties of chromatin. A dimerizing protein 
construct of mCherry and anti-green fluorescent protein (GFP) nanobody fused with tetracycline repressor 
(TetR) binds to tetracycline operator (TetO) sequences within the genome. The addition of ferritin-enhanced 
GFP magnetic nanoparticles allows for movement of the chromatin with magnets. This reveals fluidlike 
behavior of chromatin (left) compared with unperturbed chromatin in the absence of the magnetic field (right). 
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is also more physiologically appropriate 
than previously described; subpiconewton 
forces are associated with RNA polymerase 
II transcription and with motor proteins 
within the nucleus (4). 

Chromatin must undergo conforma- 
tional changes during and after replica- 
tion, as observed during mitosis, rear- 
rangements during meiosis, and structural 
folding as the cell cycle progresses (7). 
Conformational changes are also observed 
during nuclear deformations, as observed 
when immune and cancer cells traverse 
distinct tissues (8). Local chromatin dy- 
namics influence how transcription fac- 
tors dock to target sequences and regulate 
long-range interactions between distant 
sequences (7). The observation that chro- 
matin is fluid prompts reexamination of 
the biophysical principles underpinning 
these processes. 

Two discoveries reported by Keizer et al. 
require the revisiting of existing ideas. That 
subpiconewton forces are sufficient to phys- 
ically drag a locus within the genome across 
several micrometers in a crowded nucleus 
directly contradicts a previous finding that 
chromatin remains confined with submi- 
crometer displacements after applications 
of larger forces (4). Possible explanations to 
reconcile these findings are the differences 
in the size of the probe and the amount of 
time that the forces were applied. Keizer et 
al. applied weaker forces for several min- 
utes, whereas the previous experiment in- 
volved probes that were ~1000-fold larger 
and applied higher forces for 100-fold 
shorter time scales (4). Also, the labeling 
technique in the current study relies on the 
insertion of an artificial locus, whereas a 
native locus might reveal different flexibili- 
ties. Advances in CRISPR-mediated editing 
could be used to endogenously label chro- 
matin to reconcile these findings. 

The results reported by Keizer et al. also 
challenge the view that chromatin behaves 
as a stiff, cross-linked polymer gel (3-6). 
Polymers, including chromatin, are often 
depicted as linearized assemblies of indi- 
vidual units covalently linked in a Gaussian 
chain (9). These chains can be modeled as 
beads connected with harmonic springs. 
Instead, the authors observed that chroma- 
tin has fluidlike properties and behaves as 
a free polymer. The Rouse model is usually 
used for concentrated polymer solutions, 
where a drag force can be used to under- 
stand forces on the polymer as it undergoes 
motion with minimal contributions result- 
ing from hydrodynamics (9). In the study 


Laboratory of Cell Biology, Center for Cancer Research, 
National Cancer Institute, National Institutes of Health, 
Bethesda, MD, USA. Email: kandice.tanner@nih.gov 


SCIENCE science.org 


of Keizer et al., the observed dynamics 
suggested that there are moderate contri- 
butions of hindrance, topology, and cross- 
linking because of interactions with other 
chromatin regions in a crowded nucleus. 
One explanation might be that there may 
be subdomains within chromatin, where 
a mixture of states of gel-like domains are 
interspersed within the larger liquid struc- 
ture. An additional explanation might be 
that the strength of the cross-linking is 
weak as a result of short-lived cross-links 
within the network of chromatin chains 
and associated factors. Thus, heterogene- 
ity in the material properties might exist 
at different length scales. This may further 
clarify the differential transcriptional ac- 
tivities observed for the two forms of chro- 
matin found within the nucleus (euchro- 
matin and heterochromatin) (J, 2). 

Phase separation has been shown to 
drive the formation of heterochromatin 
domains (J0). The dynamics of integrated 
loci that are fluidlike may also be impor- 
tant for phase separation of chromatin 
in vivo. Further studies involving the ma- 
nipulation of additional chromosomes us- 
ing endogenous probes will address the 
generalizability of the findings. The study 
of Keizer et al. also opens possibilities 
to mechanically manipulate gene loci in 
vivo. The authors cultured cells on pillars. 
However, mechanotransduction of longer- 
range forces, such as shear stress, differen- 
tial tissue rigidity, or osmotic stresses, in- 
fluence local chromatin dynamics through 
direct cross-talk between the nucleus and 
the extracellular milieu (77). Thus, cou- 
pling these technical advances with cul- 
ture models, such as three-dimensional 
organoids, can be used to address the 
knowledge gap between micro- and meso- 
scale behaviors. 
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Engineering 
defects 
with DNA 


Single-walled carbon 
nanotubes are structurally 
modified by using 

a genetic sequence 


By YuHuang Wang 


trategically introduced defects can 
be used to modify carbon nanotubes 
for new properties and functions. 
For example, chemical defects can 
act as atomic traps for electrons, 
holes, electron-hole pairs, and even 
molecules and ions (7). The ability to con- 
trol the placement of these defects on car- 
bon nanotubes could enable a plethora of 
fundamental studies and potential appli- 
cations in imaging, sensing, disease diag- 
nostics, and quantum information science 
(2-4). However, there is yet no effective 
way to do this with atomic precision (5-7). 
On page 535 of this issue, Lin et al. (8) re- 
port the creation of ordered defect arrays 
by programming DNA strands to wrap 
around and traverse the entire length of a 
single-walled carbon nanotube. 
Modifications of single-walled carbon 
nanotubes are typically random, as all 
reactive sites on the extensive hexagonal 
lattice of the carbon nanotube are effec- 
tively the same (7). Lin et al. show that 
it is possible to program the placement 
of defects on the carbon lattice by using 
single-stranded DNA. In one example, 
they use a specific DNA strand made of 
guanine (G) and cytosine (C), with the 
sequence CCCGCCCCCCCGCCC, to wrap 
around a single-walled carbon nanotube, 
with each of the two guanines covalently 
linked to the nanotube forming an atomic 
defect (see the figure). The two defect sites 
are separated by five carbon-carbon bonds 
along a helical line of the hexagonal lat- 
tice known as the “armchair line,’ which 
travels down the nanotube similar to the 
threads on a screw. The process created 
ordered arrays of defects and pinned mul- 
tiple DNA strands along the armchair line. 
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Lin et al. observed this transformation 
using cryo-electron microscopy (cryo-EM), 
which measures the shadows of a macro- 
molecule at many different orientations 
to reconstruct its three-dimensional (3D) 
structure. In their experiment, reconstruc- 
tions from the cryo-EM images of 44,393 
DNA-functionalized carbon nanotubes re- 
vealed that the single-stranded DNA forms 
a helical pitch of 6.5 A, which closely 
matches the periodicity of the aforemen- 
tioned armchair line of the carbon nano- 
tube lattice. For comparison, this order- 
ing is not observed in the controls. The 
formation of ordered defect arrays is also 
evidenced by spectral signatures that are 
strongly dependent on the spatial distribu- 
tion pattern of the defects. When the two 
bonding G’s are spaced at a distance that 


This guanine photochemistry creates cova- 
lent defects on single-walled carbon nano- 
tubes, but the ordering of those defects 
was not previously observed (6). For these 
covalent defects to occur in an ordered ar- 
ray, the bonding reactions must occur in a 
way that avoids locking into a disordered 
configuration. To this end, the use of a 
water-methanol solvent might have al- 
lowed the defects to be created, as the 
methanol unwraps the dispersing DNA 
from the nanotube, making space for the 
reaction. Further research will be needed 
to decipher the mechanism of the ordering, 
which is important for designing and guid- 
ing the synthesis of these defect arrays. 

It is also worth highlighting the success- 
ful use of cryo-EM for resolving the oligo- 
nucleotides. Although cryo-EM has been 


Programming atomic defects on a carbon nanotube 
By incorporating reactive guanines into a DNA sequence (orange), Lin et al. create ordered arrays of defects 
(red) at a pitch of ~6.5 A along a helical line (blue) on the hexagonal lattice of a carbon nanotube (gray). 


matches the helical pitch, the disorder- 
induced Raman scattering is minimized. 


Consistently, the defect photolumines- 
cence peak shows minimum spectral shift 
and width at the same spacing between 
the G’s and reveals a picture of populating 
electron-hole pairs, or excitons, between 
shallow defect traps in an array (9). 
Although Lin e¢ al. succeeded in design- 
ing a complex procedure for creating these 
defects, the exact mechanism behind the 
ordering remains somewhat of a mystery. 
The nanotubes are initially dispersed and 
stabilized individually in an aqueous so- 
lution by using a different DNA sequence 
(TTTCCCTTTCCCCCC; T is thymine), 
which is later displaced by bonding the 
G-containing sequence to the nanotube 
using 525-nm light to trigger the reaction. 
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powerful in determining the structure of 
proteins and other large biomolecules in 
the past, smaller molecules have typically 
been challenging for cryo-EM to resolve 
because of the low signal-to-noise ratio. 
The success here may be attributed, at 
least in part, to the nanotubes serving as 
an atomically precise ruler and reference 
for the 3D reconstruction. This “hack” may 
provide an exploit if it can be applied to 
resolving other small molecules that pos- 
sess this property. 

With a reliable method to create an or- 
dered array of carbon nanotube defects, 
researchers may explore its applications— 
e.g., engineering defects as quantum traps 
to harness electron spins, excitons, trions, 
and single photons for a wide range of ap- 
plications, such as in quantum chemical 


sensing (J). These traps, known as “sp’” 
defects, are considerably shallower, and 
thus dimmer in photoluminescence, than 
“sp*” defects, in which each carbon atom 
at the defect site is bonded to four instead 
of three other atoms (10). Although the 
weaker defect photoluminescence may 
limit the applications of these defects in 
areas such as imaging (2) and quantum 
light sources (3), their transport properties 
may be especially useful for a different as- 
sortment of applications. One such oppor- 
tunity is for the experimental verification 
of the model for organic superconductors 
proposed more than 50 years ago by physi- 
cist Bill Little (21), in which he postulated 
that a room-temperature superconductor 
can be realized in a wire functionalized by 
ordered arrays of polarizable side chains 
that attract electrons. The finding by Lin 
et al. may provide a way to synthesize the 
wire imagined by Little and put the pre- 
diction to the test. The ability to program 
these defects along the length of a nano- 
tube may also help create the so-called 
“Heisenberg spin chain,’ which can theo- 
retically function as a chain of entangled 
quantum bits, with potential applications 
in quantum information processing and 
computing (12). 

To realize these ambitious visions, fu- 
ture research efforts must expand and cap- 
italize on the vast design space of DNAs, 
defect chemistry (J, 13), and precision 
nanotubes (14); enable deeper traps (10); 
and increase the size of the defect array. 
With the defect ordering demonstrated in 
this work, Lin et al’s discovery (8) marks 
an important step, and the rush to order 
has started for defects. & 
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POLICY FORUM 


GLOBAL HEALTH 


A treaty to break the pandemic cycle 


An evidence-based treaty must balance prevention, preparedness, response, and repair 


By Alexandra L. Phelan*?3 and 
Colin J. Carlson!?* 


orld Health Organization (WHO) 
member states have tasked an 
Intergovernmental Negotiating 
Body with developing an initial 
draft of a “pandemic treaty” by 
August 2022, with the goal of 
adoption of an agreement by May 2024. 
There are multiple proposals for the spe- 
cific aims that such an international legal 
instrument for pandemic preparedness 
and response should incorporate. One un- 
satisfying solution would be to relegate the 
pandemic treaty solely to matters of human 
health as much as possible. International 
law, including the International Health 
Regulations (IHR), has typically operated 
in these types of fragmented, thematic si- 
los. This untenable approach ignores key 
lessons from the past 2 years, and well 
before. A better option is a treaty that 
(pro)actively brings cohesion to interna- 
tional law and the haphazard governance 
that left the world vulnerable to both the 
risks and impacts of COVID-19. 
Considerable scientific evidence sug- 
gests that the world is trapped in a positive- 
feedback loop: Disease spillovers from 
animals to humans become outbreaks that 
then become pandemics, creating many 
catastrophic impacts that can reduce re- 
siliency and increase the socioecological 
drivers of spillovers. Pandemics connect 
to every aspect of society, both driven by 
and spilling over into politics, the environ- 
ment, economics, and systematic inequali- 
ties and injustices. This imposes a substan- 
tial burden on the negotiation process, as 
stakeholders lobby for the treaty to be all 
things for all interests. Left unchecked, 
this might result in a treaty that fails at 
its primary purpose: preventing, preparing 
for, and responding to pandemics. 
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Closing the “pandemic gap” requires 
evidence-based solutions that break every 
step of the cycle, including reducing spill- 
over risk, reducing pandemic risk, re- 
ducing pandemic impacts, and ensuring 
recovery and resilience. We identify 12 
elements for a cohesive, transformative, 
and evidence-based treaty (see the box). 
We describe how these elements could be 
translated into provisions under the treaty, 
with a focus on principles of good gover- 
nance, distributive justice, and human 
rights and coherence within international 
law. A treaty that balances these aims will 
move global health governance beyond the 
limited scope of the IHR and provide clar- 
ity and complementarity to other relevant 
international legal regimes. 


SPILLOVER PREVENTION 

Pandemics start in nature. Stopping some 
at the source could be cost effective, and 
many proposed solutions would inciden- 
tally benefit conservation and other envi- 
ronmental causes. 


Planetary health solutions 

Anthropogenic environmental change 
drives poorer baseline human and ani- 
mal health and is increasing the rate of 
viral spillover. To date, land use has been 
the biggest upstream driver, linked to 
18% of emerging disease outbreaks since 
1940 (1). Within the next decade, it will 
be overtaken by climate change, even if 
greenhouse gas emissions are reduced 
(2). Global environmental issues must 
be embedded throughout the treaty. This 
includes preambular language that ex- 
pressly recognizes the roles that climate 
change, biodiversity loss, land degrada- 
tion, and wildlife trade play in pandemic 
risk. Other elements of the treaty may also 
support environmental aims: For exam- 
ple, obligations to build sustainable pri- 
mary health care systems may help reduce 
deforestation and meet greenhouse gas 
emissions targets (3). To formalize coben- 
efits, a treaty can expressly interface with 
existing obligations under other interna- 
tional laws, such as the Convention on Bi- 
ological Diversity (CBD), the Convention 
on International Trade in Endangered 


Species (CITES), and the United Na- 
tions Framework Convention on Climate 
Change (UNFCCC). 


One Health solutions 

Outbreaks are multicausal: A recent analy- 
sis of 41 socioecological risk factors (such 
as mining, hunting, and travel) showed 
that in the 100 largest zoonotic disease 
outbreaks since 1975, no one driver could 
be linked to more than 13% of events (4). 
Singular solutions that only target one 
aspect of the _ wildlife-livestock-human 
interface are therefore unlikely to suc- 
ceed at scale. Taking lessons from multi- 
lateral environmental agreements, and in 
coherence with states’ existing obligations 
in relation to the World Organisation for 
Animal Health (WOAH, formerly OIE) 
and the Food and Agricultural Organiza- 
tion (FAO), the treaty should provide flex- 
ibility for locally tailored risk assessments 
and intervention strategies while instilling 
obligations to conduct animal disease sur- 
veillance and fund a stable, well trained 
One Health workforce (integrating hu- 
man, animal, and environmental health). 
Historically, these investments have been 
focused in tropical countries and around 
high-risk interfaces like wildlife markets 
and agriculture; a forward-looking pan- 
demic treaty must expand this approach, 
cognizant of scientific evidence that, par- 
ticularly in a changing climate, zoonotic 
emergence is a global risk (2). 


Zoonotic risk assessment 

After facing five influenza pandemics in less 
than a century, global strategies have been 
established to sound the alarm when new 
threats emerge. COVID-19 underscores the 
need to broaden this approach: Although 
flu is the most persistent pandemic threat, 
there are hundreds of coronaviruses, para- 
myxoviruses, poxviruses, flaviviruses, and 
others that will cross the species divide in 
the coming century. Only a small fraction of 
spillovers are likely to become pandemics; 
the task of risk assessment therefore relies 
on both surveillance (including viral dis- 
covery in animals, syndromic surveillance 
in humans, and monitoring at the inter- 
face between them) and science (including 
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both experimental characterization and 
computational prediction). Establishing no- 
tification and data-sharing obligations for 
animal disease outbreaks and One Health 
discoveries of public interest, coupled with 
both risk assessment and benefit sharing 
provisions, would support global efforts to 
identify potential zoonoses, develop counter- 
measures like universal vaccines, and moni- 
tor for spillover. 


PANDEMIC PREVENTION 

Even with upstream mitigation, spillover 
remains a persistent problem. The primary 
aim of a pandemic treaty must be to prevent 
outbreaks from becoming pandemics. 


Surveillance and assessment 

Infectious disease surveillance is severely 
limited by funding, workforce, and tech- 
nological capacity. Genomic epidemiology 
is vital both to identify outbreaks and to 
assess the risk of human-to-human spread 
(5). Under the IHR, there is no express 
obligation to share pathogen samples or 
genomic sequence data (GSD), even after 
notification of a potential public health 
emergency of international concern. The 
pandemic treaty must consider the balance 
of obligations to share data, obligations on 
high-income countries to provide capacity 
building and technology transfer, and obli- 
gations to equitably share the benefits from 
the use of GSD, including not only risk as- 
sessments but also diagnostics, vaccines, 
and treatments. A treaty may also support 
innovation in disease detection (including 
wastewater surveillance for both new and 
ongoing outbreaks) and real-time digital 
sharing of epidemiological data. 


Biomedical R&D and production 

Under one estimate, $24 billion would be 
sufficient to develop prototype vaccines for 
every family of viruses that can infect hu- 
mans (6). With biomedical investments like 
these, and international initiatives like the 
Coalition for Epidemic Preparedness Inno- 
vations, scientific advances could lead to 
game-changing vaccine platforms and tech- 
nologies, potentially including universal 
vaccines. For production and deployment 
to keep pace with these technological ad- 
vances, the pandemic treaty must address 
the substantial legal and political impedi- 
ments to technology transfer and capac- 
ity building. This includes taking lessons 
from other regimes, including international 
climate law, for financing and technology 
mechanisms to build capacities and fa- 
cilitate hard and soft technology transfer. 
A treaty can set priorities and establish 
an end-to-end research and development 
(R&D) platform, predictable financing, and 
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regional platforms and engage industry and 
states in prenegotiated contracts and agree- 
ments from R&D to production (7). 


Health systems strengthening 

Even before vaccination began, health care 
disparities made COVID-19 twice as deadly 
in low- and middle-income countries (8). 
The goal of achieving both universal health 
coverage (UHC) and global health security 
must form the foundation of a pandemic 
treaty if it aims to prevent outbreaks from 
becoming pandemics. The treaty must 
explicitly acknowledge the importance 
of health systems strengthening (HSS) 
in achieving both UHC and global health 


Twelve elements 
of a pandemic treaty 


A successful treaty must balance several 
aims between prevention (reducing 
both spillover and pandemic risk) and 
response (reducing pandemic impacts 
and improving recovery and resilience). 


Reduce spillover risk 

* Planetary health solutions (global) 
* One Health solutions (local) 

« Zoonotic risk assessment 


Reduce pandemic risk 

* Surveillance and assessment 

* Biomedical R&D and production 
* Health systems strengthening 


Reduce pandemic impacts 

* Equitable access to global goods 
* Emergency legal preparedness 

+ Least restrictive measures 


Recovery and resilience 

+ Adaptive governance 

* Accountability and transparency 
* Reduce inequities and injustice 


” 


security and use a “right to health” ap- 
proach to ensure that health systems are 
accessible, available, acceptable, and of 
sufficient quality to all. This should include 
express commitments to HSS and UHC, in- 
cluding financial commitments: Countries’ 
investment choices will account for an 
870-million-person margin of uncertainty 
in projected health coverage by 2030 (9). 
To achieve the best outcomes possible, a 
treaty must include a funding mechanism 
with a target of meeting the WHO’s exist- 
ing goal of 1 billion more people covered 
by UHC. This would not only be funda- 
mental to prevent spillovers from becom- 
ing pandemics but would bring a suite of 
health cobenefits, like reducing maternal 


and child mortality and the burden of ne- 
glected diseases, as well as reducing pan- 
demic impacts. 


PANDEMIC RESPONSE 

As of now, there is no international law that 
sufficiently governs a pandemic response, 
leading to the kinds of dysfunction wit- 
nessed during COVID-19. 


Equitable access to global goods 

The treaty must address the global injus- 
tice of the inequitable sharing of benefits 
from the use of pathogen samples and 
GSD, including vaccines, diagnostics, and 
therapeutics seen in current and previous 
pandemics. The treaty is the most critical 
opportunity that we have to establish a mul- 
tilateral mechanism to facilitate this in line 
with global health and equity and consis- 
tent with other areas of international law, 
like the Nagoya Protocol to the CBD. In ad- 
dition, the treaty should limit legal barriers 
to global equitable access, such as export 
controls, and set the terms of use for ad- 
vance purchase agreements, including the 
use of prenegotiated standard terms and 
establishment of a multilateral distribution 
mechanism (7). The treaty is an opportunity 
to dismantle intellectual property law bar- 
riers to equitable access in a forum that is 
more democratically representative of the 
Global South than the World Trade Organi- 
zation, potentially better enabling the pri- 
oritization of global health and equity. 


Emergency legal preparedness 

Law has long been undervalued as a criti- 
cal component of emergency preparedness 
and response. However, legislative and judi- 
cial actions throughout the pandemic have 
left many countries’ public health laws in 
tatters (JO). Governments are substantially 
less empowered to protect health now and 
during the next pandemic. Although the 
IHR require States Parties to have enabling 
legislation as part of their core public health 
capacities, the contents of these obligations 
are poorly defined across implementa- 
tion initiatives. It was only this year that 
the Global Health Security Agenda, aimed 
at strengthening global health security 
by building core public health capacities 
worldwide, added a program sufficiently 
dedicated to legal preparedness. Translat- 
ing the IHR’s technical approaches into 
domestic law has been a major challenge, 
partly because domestic laws depend on 
political will for action. Here, a pandemic 
treaty that is not designed exclusively as a 
technical instrument but also carries nor- 
mative weight may be better equipped to 
facilitate the political momentum necessary 
for domestic law reform. 
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Least restrictive measures 

Public health and human rights demand 
that governments use the least restrictive 
measures necessary to achieve proportion- 
ate public health outcomes. The pandemic 
treaty must provide the governance neces- 
sary to guide evidence-based public health 
measures and respect for human rights. It 
must also establish mechanisms to moni- 
tor restrictive public health measures used 
during pandemics and ensure accountabil- 
ity of states under global health law (/1). 
This includes addressing the use of travel 
restrictions. Scientific evidence indicates 
that international travel restrictions may 
delay the spread of pandemic pathogens 
(12), but they have been applied inconsis- 
tently and discriminatorily and some have 
been more restrictive than necessary. The 
use of travel restrictions has serious im- 
plications for the next pandemic, poten- 
tially disincentivizing rapid notifications 
of new outbreaks. In coherence with the 
THR and any reforms, the pandemic treaty 
is an opportunity to reconsider the types 
of incentives available for rapid reporting, 
including funding and compliance mecha- 
nisms, as well as for normative commit- 
ments to least restrictive measures at the 
highest levels of government. 


RECOVERY AND RESILIENCE 

A pandemic treaty offers the best, and pos- 
sibly last, chance to rebuild global trust and 
show that multilateralism has the political 
and practical ability to build a safer and 
fairer world. 


Adaptive governance 

Responding to the changing landscape of 
global health will require a treaty to estab- 
lish a regular conference of parties (COP). 
These meetings will facilitate norm-build- 
ing, interpretation consensus, compliance 
assessment, and the negotiation of proto- 
cols if a framework convention model is 
adopted. Learning from the relationship 
between the UNFCCC and Intergovern- 
mental Panel on Climate Change (IPCC) 
[and, similarly modeled on it, the CBD 
and the Intergovernmental Panel on Bio- 
diversity and Ecosystem Services (IPBES)], 
the treaty should also establish an inter- 
governmental science-policy platform for 
pandemic prevention, preparedness, and 
response, which will inform decision-mak- 
ing at COPs. This platform must have a pri- 
mary mandate of evidence synthesis across 
all four quadrants of the pandemic cycle, 
explicitly not in support of any agenda 
beyond consolidation and evaluation of 
scientific literature, with an independent 
public process for government and civil 
society review. Building on this, and tak- 
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ing the lessons from the Biological Weap- 
ons Convention, the treaty should build in 
express opportunities for regular review 
conferences, ensuring that the treaty itself 
survives and adapts to changing science, 
technologies, and epidemiology. 


Accountability and transparency 

The COVID-19 pandemic has decreased 
public trust in both scientists and govern- 
ment officials. Treaty elements that pro- 
mote accountability and transparency will 
be essential to its success and good gover- 
nance, establishing the rule of law for state 
conduct. Similar to mechanisms under in- 
ternational climate law, the treaty should 
include an independent compliance body 
tasked with facilitating implementation 
and assistance, conducting verification, 
and promoting compliance through trans- 
parent and regular reporting (73). A range 
of other compliance mechanisms have 
been proposed, from empowering WHO 
or the United Nations Secretary-General 
to conduct investigations to sanctions for 
delayed notifications. However, punitive 
measures risk undermining the trust and 
transparency necessary for public health, 
even at the global scale. Instead, the treaty 
should establish transparency and ac- 
countability through positive incentives. 
Civil society will also be a critical com- 
ponent of holding states accountable for 
their obligations under the treaty, includ- 
ing in any shadow reporting processes, and 
protecting the rights of health care work- 
ers, scientists, and lawyers who serve as 
whistleblowers (/4). In recent years, many 
governments have actively limited the 
space for civil society to operate, share in- 
formation, and conduct advocacy in public 
health; to remedy this, the treaty must en- 
shrine recognition of civil society organiza- 
tions and factor in appropriate protections 
and participation in compliance processes. 


Reduce inequities and injustice 

Throughout all of the elements, there must 
be an express commitment by States Par- 
ties to reduce the inequities and injustices 
that make pandemics disproportionately 
affect vulnerable populations, both within 
and among countries. Although this rec- 
ognition should include preambular lan- 
guage that sets the tone for the treaty, it 
must go further. The treaty must contain 
a clear, binding obligation of nondiscrimi- 
nation in the implementation of pandemic 
prevention, preparedness, and response 
measures. The negotiation process, the 
science-policy platform, and the treaty’s 
governance arrangements must all have 
express requirements for equitable geo- 
graphical representation and diversity of 


gender, country income levels, and disci- 
plinarity. Finally, the treaty must not be 
given the opportunity to recapitulate the 
colonial histories of global health and in- 
ternational law. Deliberations about how 
a pandemic treaty enshrines and advances 
the decolonization of global health should 
be central to its conception and led by 
countries and communities that have his- 
torically been excluded from both global 
governance and scientific scholarship. 


BREAKING THE CYCLE 

Breaking the pandemic cycle and disman- 
tling injustice are daunting tasks for a 
treaty, particularly one designed within a 
system that has perpetuated and institu- 
tionalized both. Even this early in negotia- 
tions, many already anticipate that on the 
current trajectory, a treaty will fall short 
of these ambitions (75). Negotiators must 
take these concerns as a clarion call or risk 
far-reaching repercussions for the legiti- 
macy of global health and multilateralism 
beyond the treaty itself. Every remaining 
moment is a once-in-a-generation oppor- 
tunity to negotiate an ambitious and eq- 
uitable treaty and fight for provisions that 
materially improve the health of humans, 
animals, and ecosystems. 
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BATTERIES 


The hidden costs of batteries 


Clean energy plans must include safer, more sustainable, 


worker-friendly power storage 


By Benjamin K. Sovacool 


atteries are everywhere—in our 

mobile phones, our remote con- 

trols, our toys, and increasingly 

in the fleets of electric vehicles on 

our roads. Batteries are ubiquitous 

because they can do many things: 
They can store energy in homes, improve 
the resilience of electric grids, and assist 
with the integration of low-carbon electric- 
ity sources such as wind farms and solar 
photovoltaic panels. 

Batteries, argues environmental histo- 
rian James Morton Turner in his new book, 
Charged, are the cornerstone of our mod- 
ern “culture of mobility,’ one that depends 
on portable, reliable, and cordless sources 
of energy. In Charged, Turner offers an em- 
inently readable, elegantly precise treatise 
on the topic of batteries. The book consists 
of only four main chapters. An introduc- 
tion describes the role of batteries in our 
modern economy, and a conclusion out- 
lines recommendations for cleaner forms 


The reviewer is at the Department of Earth and 
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of energy. In between, the book includes a 
chapter on lead-acid batteries and cars, a 
chapter on AA batteries and consumer or 
“throwaway” goods, a chapter on lithium- 
ion batteries and smart phones, and a 
chapter on electric mobility. The book even 
offers tips on how to make a homemade 
battery from a lemon. 

Charged is more critical of battery power 
than most other books on this topic, not- 
ing that they appear “clean” only 
because they obscure other, more 
discrete, impacts. In the environ- 
mental dimension, batteries de- 
pend on some of the most toxic 
metals and materials in industrial 
supply chains. The list of environ- 
mental insults is long and sober- 
ing: lead, sulfuric acid, mercury, 
manganese, zinc, steel, carbon, 
graphite, ammonium chloride, 
potassium hydroxide, cadmium, 
lithium, nickel, cobalt, and some- 
times rare earth minerals. Batteries are 
thus intimately connected with mining and 
extraction and with energy-intensive pro- 
cessing. When they are used up, they often 
return to the earth in the hazardous form of 
waste or pollution. 

The material economy behind batter- 
ies also has a poignant human and social 


University 0 


Charged 
James Morton Turner 


Press, 2022. 256 pp. 


Taxi drivers in Shenzhen, China wait for 
their electric vehicles to charge. 


cost. People at both ends of the battery 
supply chain—extraction and waste—must 
handle the toxic materials they contain, ei- 
ther coping with the precarious practices 
of artisanal mining for lithium and cobalt 
or managing burgeoning flows of elec- 
tronic waste at scrapyards. Every step in 
the battery manufacturing process moves 
toxic materials such as lead or cobalt into 
workers’ bodies, notes Turner, “most often 
through what they ate, drank, or, most 
problematically, breathed.” 

These issues provoke Turner to recom- 
mend some fairly substantial, and likely 
controversial, recommendations. He _ be- 
lieves that governments must lead to build 
a clean energy future from the ground up. 
He suggests that policies are needed to pro- 
mote more responsible sourcing of battery 
materials and more responsible manufac- 
turing and production of batteries. He ar- 
gues in favor of new and expanded mining 
operations (as well as more sustainable 
refining operations) in the United States. 
Most provocatively, he argues that planners 
should cease “trying to recycle our way to 
a clean energy future” and reject “circular 
economy” ideas in favor of more practical 
materials and supply chain management 
within the battery production process, in- 
cluding more responsible mining practices. 

In elucidating the complex history of 
batteries in our society, the book offers a 
much needed exposé on the human and 
environmental side of proposed low-car- 
bon transitions that depend on batteries or 
energy storage. The book also underscores 
how energy choices are always about more 
than energy technology or resources. They 
intertwine with, and even accelerate, 
practices of automobility, hy- 
perconsumption, digitalization, 
and social media. Moreover, the 
book demonstrates that energy 
security plans and energy policy- 
making are not, at their core, 
adequately addressing the many 
social and environmental dilem- 
mas introduced by batteries. 

The risk, as Turner states at 
the end of the book, is immense: 
“scaling up a clean energy future 
could very easily drive social in- 
equities and concentrate economic power, 
just as the fossil fuel-powered extractive 
economy of the twentieth century did.” But 
so is the ultimate reward for designing safer, 
more sustainable, worker-friendly batteries, 
which could become the foundation for our 
next global energy transition. 

10.1126/science.add0714 
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The ascent of Moderna 


From scrappy startup to household name, a journalist 
traces the trajectory of the company and its mRNA vaccine 


By Elie Dolgin 


n early 2020, US government scientists 

were preparing to conduct a drill to de- 

termine how rapidly a new vaccine tech- 

nology could be deployed to combat an 

emerging pathogen. The deadly Nipah 

virus had recently leapt from bats into 
people in southern India, killing nearly ev- 
eryone who was infected. The plan: to see 
how quickly researchers at the National In- 
stitutes of Health, in collaboration with the 
biotech company Moderna, could build a 
vaccine from scratch and launch a clinical 
trial, all using the then-unproven messen- 
ger RNA (mRNA) vaccine platform. 

The drill, however, soon took a backseat to 
a real respiratory virus that had begun sick- 
ening dozens in central China. In a record- 
setting 66 days, Moderna and its government 
partners went from having a genetic se- 
quence of the new coronavirus to a first-in- 
human vaccine trial. A US Food and Drug 
Administration-authorized vaccine followed 
just 10 months later, and Moderna became a 
household name seemingly overnight. 

Peter Loftus, a reporter for The Wall Street 
Journal, was quick to recognize the historic 
magnitude of Moderna’s vaccine project in 
its earliest days. He visited the company’s 
headquarters in Cambridge, Massachusetts, 
in mid-February 2020, weeks before any 
cases of community transmission had even 
been documented on U.S. soil. He broke news 
of the shipment of Moderna’s first batch of 
clinic-bound vaccine later that month and 
stayed in touch with company executives. His 
new book, The Messenger, offers readers an 
insider’s view of the company’s ascent from 
secretive startup to one of the most valuable 
health care companies in the world. 

The protagonist of The Messenger is 
longtime Moderna chief executive Sté- 
phane Bancel. An ambitious businessman 
with an unwavering belief in his company’s 
technology, Bancel is, as Loftus writes, a 
“wizard at fundraising.” 

Before the pandemic, Bancel had amassed 
staggering sums of money on the promise 
that mRNA could be turned into powerful 
treatments for genetic diseases, cancer, heart 
failure, and more. When that idea did not 
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pan out, the company pivoted to vaccines 
for infectious diseases and used data from 
a small, proof-of-concept flu vaccine trial to 
bring in more cash. With each new infusion 
of capital came hiring sprees and infrastruc- 
ture investments, along with pilot projects 
for other viruses with pandemic potential, all 
of which meant that Moderna had a vaccine 
platform ready when COVID-19 came along. 

Loftus captures Bancel’s idiosyncrasies 
with some amusing personal details. He re- 
veals, for example, how the Moderna CEO 
once gifted a Hermés belt to an academic 
who had previously teased him about the 
buckle design. The journalist’s eye is drawn 
to the many paperweights that adorn Ban- 
cel’s bookshelf—each containing a tiny vial 
of drug or vaccine encased in glass. 

The book also discusses Bancel’s brusque 
management style that rubbed many em- 
ployees the wrong way. He yelled. He de- 
meaned. “He hired people and made their 
life hell,’ as one former scientist tells Lof- 
tus. However, despite noting unusually 
high rates of staff turnover and previously 
chronicled complaints of a toxic work en- 
vironment, the book does not offer many 
new fly-on-the-wall details on this topic. 

Instead, Loftus focuses on money—how 
much Moderna raised from investors, the 
fluctuations of its stock price, the compensa- 
tion packages of its executives, the billionaire 
status of its founders, the taxpayer-backed 
contracts that underwrote some of the com- 


The Messenger: 
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pany’s riskiest bets. The company, we learn, 
raked in more funding faster than just about 
any other in biotech history, giving it advan- 
tages of scale and prestige. 

But Moderna did not invent the idea 
of mRNA-based medicines. It was not the 
first to win authorization for an mRNA- 
based COVID-19 vaccine. And yet, in Lof- 
tus’s telling, “It’s largely Moderna that is 
being credited with establishing the dis- 
ruptive mRNA technology.” 

“A doubted underdog driven by relent- 
less idealists,’ Moderna is presented as the 
biotech David against pharma Goliaths. It 
is true that an industry giant (Pfizer) helped 
BioNTech—the German company that beat 
Moderna to the mRNA vaccine punch—get 
its COVID-19 shot over the finish line. But 
Moderna was not operating unaided—it had 
the weight of the US government behind it. 

Moderna deserves a great deal of credit 
for its decade-long slog to push mRNA 
medicines forward. But just as with the for- 
mulation of calculus in the 17th century, the 
discovery of oxygen in the 18th century, or 
the theory of evolution in the 19th century, 
mRNA vaccine technologies were in the 21st 
century ether, ready for development. 

Now, flush with profits from its COVID-19 
vaccine, Moderna needs to prove that it is 
more than a one-trick pony. All eyes are on 
the messenger to see if it will be the bearer 
of good vaccine news to come. & 

10.1126/science.add1577 
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Campbell recommends a primarily plant-based diet. 


PODCAST 

The Future of Nutrition: An Insider’s 
Look at the Science, Why We Keep 
Getting It Wrong, and How to Start 
Getting It Right 

T. Colin Campbell with Nelson Disla 
BenBella Books, 2020. 336 pp. 


Having spent his youth on a dairy farm, 
biochemist T. Colin Campbell was an 
advocate for the consumption of meat 
until his own research began to suggest 
that diets high in animal protein may 
increase the risk of developing certain 
diseases. This week on the Science pod- 
cast, Campbell discusses how food can 
influence health and details the benefits 
of consuming a plant-focused diet. 
https://bit.ly/3yO1cOO 
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INSIGHTS 


Edited by Jennifer Sills 


Funding databases for 
Ukrainian academics 


The ongoing Russian invasion jeopar- 

dizes research in Ukraine. More than 200 
Ukrainian educational institutions have 
been destroyed (7), and an estimated several 
thousand academics have fled the country 
(2, 3). However, the majority of researchers— 
about 70,000 people—remain in Ukraine, 
where they are struggling to make a living 
and continue their research (2, 3). 

The international science community has 
provided generous support to Ukrainian 
researchers. Fellowships have been made 
available to displaced academics (4, 5). 
Universities have offered remote working 
opportunities (6). National funders have 
stepped up with open calls for Ukrainian 
academics (7, 8), and displaced-academics 
programs have prioritized Ukrainians (9). 

Initially, most job and funding offers 
were advertised locally on university 
or funding body websites as well as on 
Twitter. However, many offers have gone 
unnoticed. To find them, Ukrainians must 
navigate the academic systems of foreign 
countries and search multiple websites, 
many of which list offers only in the 
country’s native language. Moreover, the 
Ukrainian research community does not 
use Twitter regularly, so they are unlikely 
to see a poorly promoted tweet (3). 

Several organizations have addressed 
these challenges by creating central- 
ized databases that collect funding 


opportunities for Ukrainian researchers. 
Some databases are specific to a disci- 
pline, such as the one curated by the 
European Molecular Biology Organization 
(10). The European Commission has 
funded the Euraxess database, covering 
opportunities across Europe (71). The 
#ScienceForUkraine database, managed by 
volunteers, collects worldwide opportuni- 
ties for Ukrainians (12). 

We encourage institutions and funders 
who would like to support the Ukrainian 
academic community to streamline the 
process by including their offers in estab- 
lished databases. When advertising on 
social media, institutions can make offers 
easier to find by tagging them with widely 
known hashtags such as #ERA4Ukraine and 
#ScienceForUkraine. Ukrainians trying to 
continue their research in wartime need a 
straightforward way to access support. 
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Chinchilla conservation 
vs. gold mining in Chile 


Since pre-Columbian times, both gold and 
chinchilla fur have been in demand (J). 
From the first conquistadors to modern 
times, these resources have been severely 
exploited in the South American Andes, 
degrading the environment (2, 3) and driv- 
ing chinchillas to the brink of extinction 
(4). Now, gold mining is threatening some 
of the last wild chinchilla populations in 
Chile (5). Fortunately, Chile’s environmen- 
tal laws (6) have so far worked as designed 
to protect this Endangered species (7). 
Before the 19th century, the two spe- 
cies of chinchilla—long-tailed or coastal 
(Chinchilla lanigera) and short-tailed or 
Andean (Chinchilla chinchilla) (8)—spanned 
the coast and the Andes regions of Chile, 
Peru, Bolivia, and Argentina. As of the mid- 
20th century, chinchillas were thought to 
be extinct from their native ranges, but 
recently a few scattered colonies of short- 
tailed chinchillas were discovered in some 
of the most inhospitable and remote places 
of Chile and Bolivia (9, 10). These animals, 
likely recovering from intense human perse- 
cution over the past 200 years, need protec- 
tion from human activities to thrive. 
However, large amounts of gold have 
been discovered beneath the chinchilla 
colonies. The Gold Fields mining sites in 
northern Chile (5, 17), for example, hold 
~3.5 million ounces of extractable gold. 
The mining company is ready to begin 
extraction, but Chilean law prohibits 
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development projects from harming local 
biodiversity (6). The mining company has 
proposed a solution of moving the chin- 
chillas, but its attempts to translocate a 
small number of chinchillas failed (12). 
Meanwhile, paid lobbyists are attempt- 
ing to ease the restrictions to allow gold 
extraction to move forward. 

Short-tailed chinchillas live in one of the 
most extreme environments known for a 
mammal: rocky outcrops, barren except 
for scattered grasses, located at elevations 
of 4000 m or higher. Challenges to popu- 
lation growth include slow reproduction 
and small, dispersed social groups (4, 10). 
Mining poses a grave threat to the fragile 
equilibrium of these populations. The 
Chilean government should continue to 
abide by the current laws, which require 
the viability of translocations to be thor- 
oughly evaluated and tested before the 
chinchilla habitat is destroyed by mining, 
both in this case and in future conflicts 
between corporate goals and environmen- 
tal conservation. Other countries, including 
Bolivia, should also use this framework 
as a model to protect vulnerable species 
within their borders. 
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An octopus approaches 
the authors’ camera. 


LIFE IN SCIENCE 


Ascuffle at sea 


Submerged 15 meters deep beneath the waters of Spain’s Atlantic Islands 
National Park, we were on a mission to identify cephalopod spawning habitats, 
information that would inform the design of anew Marine Protected Area (MPA). 
We had been scuba diving in these cold waters for several days, recording data 
on octopuses, cuttlefish, and squid. Visibility was good, and we looked forward 
to what promised to be a successful dive. 

In the distance, we spotted a large octopus that was camouflaged on 
the sandy ocean floor. We carefully approached it to take some photos, but 
the octopus soon realized that it had been discovered. 
Expecting it to rise from the sand and swim away, behavior 
we had seen many times before, we maintained our 
position. However, this octopus swam straight toward us, 
at speed. 

We tried to back down, but the octopus lunged at our 
camera and ripped the diving regulator out of author 
M.E.G.s mouth. With amusement and then growing 
concern, author J.H.-U. watched as M.E.G. tried to free the 
regulator (which supplies a diver’s breathing air) from the 
octopus’s arms. But the animal refused to let go. 

J.H.-U. entered the fray, sharing air with M.E.G. and helping him wrestle with the 
octopus to retrieve the regulator, now entangled in multiple arms and held fast 
by suction. To our relief, after a few seconds, the octopus let go of the equipment. 
As it did, it released a cloud of black ink into the water to distract us and ensure 
its quick escape. Shaken, we watched as it disappeared, unharmed, into the sea. 
Our octopus encounter underscored the importance of creating MPAs to keep 
these magnificent creatures safe from illegal fishing and overfishing. Granting 
them the protection they deserve will also decrease the chances of interaction 
with humans—we would hate for a fellow diver to misinterpret their behavior and 
accuse them of being multi-armed and dangerous! 
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Q&A with Mani L. Bhaumik 


Bhaumik Breakthrough of the Year Award 
to recognize individual researchers 


By Andrea Korte 


Since 1996, the news and editorial staff of Science have looked 
back at the past year of scientific discovery to select the Break- 
through of the Year. In 2021, for instance, Science recognized the 
solution to a challenge that had puzzled scientists for 50 years: 
How to predict the shape that a string of amino acids will fold into 
as it becomes a working protein. In two papers published simulta- 
neously in Nature and Science in 2021, researchers showed how 
this could be done using artificial intelligence-driven software. 
Other recent Breakthroughs include 2019's 
first-ever image of a black hole and 2020's 
effective vaccines against COVID-19. 

Now, an international cash prize compo- 
nent-the largest bestowed by any scientific 
society—will recognize the individual sci- 
entists whose research and activities are 
foundational to the Breakthrough of the Year, 
AAAS announced in July. After the news and 
editorial staff of Science select and publish 
the Science Breakthrough of the Year, a com- 
mittee convened by the Science journals’ 
editor-in-chief will select up to three winners 
to receive the Mani L. Bhaumik Breakthrough 
of the Year Award. Starting in 2023, the con- 
vening of the selection committee will take 
place in January each year, and the announce- 
ment of the individual winner or winners will 
be made before the end of March. 

Mani L. Bhaumik, a physicist whose work on 
laser technology paved the way for Lasik eye 
surgery, shares with AAAS and the Science 
family of journals a commitment to recognizing significant develop- 
ments in scientific research. Bhaumik, whose gift is the largest in 
AAAS history, spoke with AAAS News & Notes about his inspirations 
and his current projects. 


i J 


Q: Why was it important to you to enhance the Breakthrough 

of the Year with a new prize component that recognizes 

individual scientists? 

A: The Breakthrough of the Year is done by people-it just doesn’t fall 

from the sky. People are really at the center of the Breakthrough. 
lam still a practicing scientist at 91 years old, and | know that 

beyond sheer curiosity about our world, peer recognition is one 

of the things that drives us. To be able to recognize hard-working 

scientists with this award—and perhaps prompt even more good 

work-is exciting. 

There are so many researchers focused on every aspect of every 
subject being considered for the Breakthrough of the Year. Choosing 
those scientists most integral to the winning research will be a big 
and challenging decision for the selection committee to make. 


Q: Tell me about your connections with AAAS and Science. 
A: I've been reading Science magazine since | started my career, 
when | came to California in 1959 and was working on laser develop- 
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ment. When you are in such an active field, you have to run like crazy 
and know the latest updates. But Science also covers such diverse 
subjects. You get a glance of what’s important across many fields. 

In 2019, | also endowed a prize at AAAS to support science com- 
munication and recognize excellent science communicators, the AAAS 
Mani L. Bhaumik Award for Public Engagement with Science. Many 
people think science is too esoteric. And even if they’re interested 
in the results, they might not understand the processes unless it is 
explained well. But it’s important for people to have that information in 
a way they can understand. Scientific knowledge is not just for scien- 
tists. It should be useful to everybody. 

Some of my recent projects are focused on reaching audiences and 
sharing science with them in a meaningful way that makes sense to 
them. Recently | created a cartoon series for children called Cosmic 
Quantum Ray. | created a villain who breaks natural laws—he steals 
gravity. And the good guy, Quantum Ray, comes in with his crew and 
restores the gravity. It shows children how natural phenomena like 
gravity are all around us. I’m now working on a movie for adults about 
my main research interest: quantum field theory and how it shows 
that we're all connected. There’s magic in mov- 
iemaking to get people to pay attention. 


Q: Among your projects is the Bhaumik 
Educational Foundation, which allows students 
in rural West Bengal, India, to attend college for 
free. Why is it important for you to give back? 
A: | came from extreme poverty-| didn’t 
have any shoes until | was 16. The nearest 
high school was about 4 miles away from my 
village. So, | walked there every day and was 
lucky to have good teachers and fall in love 
with mathematics and science. To go to col- 
lege, | went to Kolkata and did private tutoring 
to support myself and my studies. When | 
went back to India for a visit in 2000, | read 
a newspaper article that among the top high 
school graduates, at least half of them have 
no resources to fulfill their dreams. | thought, 
this is the best way | can pay off my dues from 
H the people who helped me. | started the foun- 
dation to give back to my country of origin, 
and it’s put about 300 students so far through university studying 
the science, engineering, math, and medicine that India needs. 


Q: What were the breakthroughs and who were the researchers 

who inspired you during your own research career? 

A: Satyendra Nath Bose was my mentor and teacher. He got me 
interested in theoretical physics. And Paul Dirac came to visit him-— 
Dirac is known as the father of the quantum field theory. He started 
talking about that, and to me it sounded unbelievable, that whether 
matter or force, they all come from similar types of fields. Meeting 
Dirac was a dramatic experience. His talk was something | never had 
thought about: All electrons are exactly the same throughout the 
universe, and there is acommon origin. 


Q: What wisdom or advice would you give to the future winners 

of the Breakthrough of the Year? 

A: My humble advice to the potential winners of the award would 
be for them to realize that science is a very hard taskmaster and 
demands dedication with unremitting strenuous work for success. 
Fortunately, the ecstasy of revealing the secrets of nature makes 
the arduous work feel like fun. 


This interview has been condensed and edited for clarity. 
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Will marshes rise up or sink? 


arsh ecosystems are vulnerable to rising sea level, in addition to land-use change and 
other human activities. Studies have shown that some marshes are gaining elevation, 
making them remarkably resilient to rising seas; however, results vary across locations and 
between contemporary and Holocene records. Comparing data from 97 sites on four con- 
tinents, Saintilan et al. found that the relationship between sediment accretion and marsh 
subsidence explains the variable responses to sea-level rise. Marshes accrete more sediment, 
keeping up with sea-level rise up to a point, but sediment subsidence increases nonlinearly with 
accretion such that at higher rates of sea-level rise, marshes begin to sink. Marshes are unlikely to 
keep up with rising seas under current climate change projections. —BEL 
Science, abo7872, this issue p. 523 


Tidal marshes, such as those found in the Wadden Sea in the Netherlands, will likely not 
accumulate sediment fast enough to keep up with rising waters. 


SCIENCE science.org 


Speed limit for 
solid-state nanoionics 


The speed of biological infor- 
mation processing in neurons 
and synapses is limited by 
the aqueous medium through 
which weak action potentials of 
about 100 millivolts propagate 
over milliseconds. Above 1.23 
volts, liquid water decomposes. 
Artificial solid-state neurons are 
not limited by such time and 
voltage constraints and can also 
be fabricated at the nanoscale, 
1000 times smaller than their 
biological counterparts. Using 
complementary metal-oxide 
semiconductor—compatible 
materials, Onen et al. prototyped 
nanoscale protonic programma- 
ble resistors that can withstand 
high electric fields of around 
10 megavolts per centimeter 
and which have energy-efficient 
modulation characteristics at 
room temperature. The pro- 
posed devices are 10,000 times 
faster than biological synapses 
and offer a promising direction 
for implementing various appli- 
cations that can benefit from 
fast ionic motion. —YS 

Science, abp8064, this issue p. 539. 


Targeting drug-resistant 
brain cancer 


A glioma is a tumor of the 

glial cells found in the brain 

and spinal cord. Glioblastoma 
multiforme (GBM) is the most 
common type of glioma, and is 
a highly aggressive brain tumor 
in dire need of new treatment 
strategies. GBM is frequently 
treated with the chemotherapy 
drug temozolomide (TMZ), but 
resistance develops in more than 
half of patients. Using a medical 
chemistry approach, Lin et al. 
designed TMZ analogs to over- 
come drug resistance in GBM 
(see the Perspective by Reddel 
and Aref). These agents generate 
a primary DNA lesion that can 
be repaired by healthy cells with 
intact DNA repair mechanisms. 
However, cancer cells that lack 
DNA-repair machinery are not 
able to repair the damage and 
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slowly evolve to harbor more 
toxic secondary lesions that 
cause selective tumor cell killing. 
—PNK 
Science, abn7570, this issue p. 502: 
see also add4839, p.467 


Close-up view of 
transport train assembly 


Cilia and flagella extend from 
the cell surface of various 
eukaryotic cells and perform 
diverse motility and signal- 
ing functions. The base of the 
cilium controls the entry of 
large intraflagellar transport 
trains that carry important 
cargo proteins throughout this 
specialized organelle. Defects 
in the ciliary base result in 
altered ciliary composition 
and human diseases. van den 
Hoek et al. used a combination 
of cryo—electron tomography 
and expansion microscopy 
techniques to study the 
molecular architecture of the 
ciliary base in the green alga 
Chlamydomonas reinhardtii. 
Their findings elucidate how 
intraflagellar transport trains 
assemble before they enter cilia 
and demonstrate the possibility 
of visualizing dynamic events 
with molecular resolution inside 
native cells. —-SMH 

Science, abm6704, this issue p.543 


Life’s bumpy road 
Throughout life, cells of the body 
acquire somatic mutations. Bae 
et al. took a close look at the 
mutational load on the human 
brain. A survey of genomic 
alterations from normal brain 
samples and samples from 
brains affected with Tourette 
syndrome, schizophrenia, or 
autism spectrum disorder 
showed that although most 
human brains do acquire these 
somatic mutations, a subset 
seem to be hypermutable. In the 
samples of brains from people 
with autism spectrum disorder, 
some of these mutations created 
greater opportunities for tran- 
scription factor binding. —PJH 
Science, abm6222, this issue p. 511 
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Correlative straining 
mapping 


The ability to simultaneously 
track elastic, plastic, and total 
strains with high resolution 
during the deformation of 
engineering materials is vital to 
understanding the micromecha- 
nisms of materials mechanics. 
Studying single-crystal tung- 
sten under mechanical load, 
Edwards et al. demonstrated 
the feasibility of measuring pure 
elastic and plastic components 
of deformation with submicro- 
meter resolution by combining 
high-resolution electron back- 
scatter diffraction and nanoscale 
digital image correlation. These 
measurements shed new light 
on toughness-inducing plasticity 
at a crack tip in tungsten. This 
approach can be used to study 
the elastic and plastic processes 
relevant to microstructure in 
various types of materials. —ML 
Sci. Adv. 10.1126/ 
sciadv.abo5735 (2022). 


Squalene incites acne 
Lipids from the skin's seba- 
ceous glands contribute to 
the pathogenesis of acne, but 
the molecular circuits allow- 
ing lipids and skin bacteria to 
collaborate and incite lesion 
formation are unclear. Do et 
al. used single-cell analysis 
technologies, including spatial 
transcriptomics, to show that 
many macrophages in acne 
lesions belong to the TREM2 
(triggering receptor expressed 
on myeloid cells 2) subset and 
are found adjacent to hair fol- 
icles and sebaceous glands. 
The addition of squalene to 
monocyte-derived macro- 
phages in culture promoted 
their differentiation into TREM2 
macrophages that exhibited 
reduced killing of Cutibacterium 
acnes bacteria. These findings 
identify disease-associated pro- 
inflammatory pathways that can 
be therapeutically targeted as 
part of an effort to better control 
the appearance of new lesions in 
patients with chronic acne. —IRW 
Sci. Immunol. 7,eabo2787 (2022). 
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Individuality in avian predators 


nan ergodic system, the average behavior of 
an individual over time resembles the average 
behavior of many individuals at a single time. 
The ergodicity of a system, or the lack thereof, 
is important for understanding any large-scale 
phenomenon such as the stock market or the 
migratory patterns of species. Vilk et al. tracked a 
total of 70 birds, all avian predators, including barn 
owls, black-winged kites, and common kestrels, 
and found that they exhibited ergodic behaviors 
when cruising over long distances but nonergodic 
behaviors when foraging over short distances. 
The authors believe that the nonergodic hunting 
preferences of the individual birds are an evolution- 


ary adaptation to how their prey vary theirs 


Irvival 


strategies. —YY Phys. Rev. X 12,031005 (2022). 


The hunting preferences of the barn owl (shown here) 
and other birds are an evolutionary adaptation to how 


their prey vary their survival strategies. 


Reusing solvent in 
nanoparticle synthesis 


Colloidal synthesis of metal 
nanoparticles can create tighter 
size distributions and poten- 
tially greater control of reactivity 
in heterogeneous catalysis than 
conventional methods such as 
wet impregnation. However, 
colloidal synthesis is a costly 
alternative because solvents 
and surfactants are normally 
used only once. Wrasman et al. 
added an antisolvent mixture 
of 2-propanol and ethanol to 
precipitate the nanoparticles 
after colloidal synthesis and 
then recovered the hydro- 
carbon solvent and unused 
surfactant through distillation. 
Synthesis of metal nanopar- 
ticles such palladium and silver, 
and alloys such as platinum- 
tin, could be repeated up to 

10 times with the recovered 
materials while maintaining 


the initial size distributions. 
These nanoparticles delivered 
highly reproducible results 
in reactions such as propane 
dehydrogenation and may offer 
a cost-competitive alternative 
as catalysts. —PDS 

J.Am. Chem. Soc. 144, 11646 (2022). 


Legacies of drought 
Drought affects aboveground 
plants, as well as their below- 
ground associated microbial and 
fungal communities, to influence 
plant growth. Lozano et al. show 
that soil microbial communities 
carry the effects of previous 
drought even into water-replete 
situations. A variety of grassland 
species, including graminoids, 
forbs, and legumes, were grown 
in soil that had previously hosted 
plants in drought or well-watered 
conditions. Root morphology 
showed changes characteristic of 
water scarcity when plants were 
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grown in soil that had previously 
experienced drought even though 
the plants themselves had not. 
Soil that has experienced drought 
may be deficient in mycor- 
rhizal fungal mutualists. As a 
result, plants growing in drought 
soil may grow thicker roots to 
encourage the establishment of 
mutualistic associations from 
depleted soil. —PJH 

New Phytol. 10.1111/ 

nph.18327 (2022). 


Managing lake 
water quality 


Large bodies of freshwater are 
under increasing threat from 
climate change and human 
activities. The clear waters of 
Lake Geneva risk eutrophication 
and declining deepwater oxygen 
because of organic matter and 
phosphorus inputs that promote 
algal growth. Deyle et al. applied 
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machine learning to time-series 
data for Lake Geneva to gain 
amore precise understanding 
of the variables affecting water 
quality. There have been efforts 
to reduce phosphorus inputs, 
but success has been limited, in 
part because increasing ambient 
temperature also promotes algal 
growth. Interdependencies among 
biogeochemical variables were 
included in a machine-learning 
approach coupled with conven- 
tional models for predicting deep 
lake—dissolved oxygen levels. The 
combined approach offers more 
nuanced guidance for lake water 
quality management. —CA 

Proc. Natl. Acad. Sci, U.S.A. 119, 

©21024661 (2022). 


Women choosing 
greener pastures 


Women faculty remain under- 
represented in chemistry 


because of a lack of appli- 
cants. Howe et al. surveyed 
130 women and found that 
themes of overwork, unrealistic 
expectations, and an emphasis 
on research over teaching and 
mentoring were the underlying 
reasons that women looked 
elsewhere for employment. 
Further analysis revealed 
that women felt unsupported 
during graduate school and 
extrapolated this lack of sup- 
port to faculty positions. This 
suggests that graduate-level 
experiences inform interests 
and career goals, and that 
systemic changes for increasing 
the diversity of faculty need to 
be made at the graduate level. 
Collectively, the results show 
that diversifying faculty posi- 
tions is much more than simply 
opening a position. Instead, it 
requires policy changes that 
explicitly value women and their 
ideals. -MMc 

JACS Au. 2, 1443 (2022). 
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Exercise boosts 
tumor immunity 


Exercise has many positive 
effects on the body, including 
improved blood circulation, 
heart strengthening, and 
weight control. Kurz et al. 
report that aerobic exercise is 
also associated with reduced 
tumor growth in models of 
pancreatic ductal adenocar- 
cinoma. They showed that 
exercise enhanced the survival 
of a type of immune cell called 
CD8* T lymphocytes, which 
are known for their ability to 
kill tumors. Greater numbers 
of CD8* T cells capable of 
infiltrating tumors were found 
to accumulate as a result of 
an exercise regimen. The anti- 
tumor effects were mediated 
through the interleukin-15 (IL- 
15) axis, and pharmacological 
targeting of the IL-15/IL-15Ra 
pathway slowed tumor growth, 
particularly when combined 
with immunotherapy and che- 
motherapy treatments. —PNK 
Cancer Cell 40, 720 (2022). 


Gender inequality 
Do inequities in women's 
editorships associate with 
an underrepresentation of 
women authors? Last et al. 
examined the association 
between women’s editorship 
and women's authorship 
across randomly selected 
journals in infectious dis- 
eases spanning each impact 
factor quartile. Only 33% of 
infectious disease journals’ 
editors-in-chief were found 
to be women. While con- 
trolling for journal impact 
factor, analysis of over 10,000 
citable infectious diseases 
articles found that although 
nearly 50% of first authors 
were women, only 35% of 
last authors were women. 
Moreover, the presence of 
more women editors was 
associated with more women 
first authors, as well as more 
women last authors. —EEU 
Lancet Infect. Dis. 10.1016/ 
$1473-3099(22)00367-X (2022). 
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INNATE IMMUNITY 
Acommon core of 
immune responses 


Dealing with pathogens is a 
constant struggle for many life- 
forms, and innate and adaptive 
immune systems are needed to 
support survival. Essuman et 
al. reviewed the latest insights 
into the Toll/interleukin-1 
resistance/receptor (TIR) 
domain proteins, which support 
immune responses across the 
tree of life, from archaea to 
bacteria to plants to humans. 
TIR domains function in innate 
immune signaling pathways, as 
well as in axon degeneration in 
animals. Some, but not all, TIR 
domains have enzymatic activ- 
ity. TIR nicotinamide adenine 
dinucleotide hydrolase activity 
starves phages as they infect 
prokaryotes and promotes 
hypersensitive cell death in a 
plant's response to pathogens. 
Plant TIR domains can also syn- 
thesize small, nucleotide-based 
second messengers that initiate 
an immune response. —PJH 
Science, aboO001, this issue p.486_ 


SOLAR CELLS 


Aradical doping approach 
In perovskite solar cells, high 
power conversion efficiencies 
(PCEs) are usually obtained with 
an organic hole transporter called 
spiro-OMeTAD. This material 
must be doped to have sufficient 
conductivity and optimal work 
function, but the conventional 
process with lithium organic salts 
requires a long oxidation step 
that also affects device stability. 
Zhang et al. added spiro-OMeTAD 
biradical precursors that convert 
into stable organic monoradicals. 
Combined with ionic salts, this 
doping strategy formed solar 
cells with high PCEs (>25%) 
and improved stability. This 
approach also allows conduc- 
tivity and work function to be 
tuned separately and could be 
applied in other optoelectronic 
devices. —PDS 

Science, abo275/7, this issue p. 495. 
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BIOPHYSICS 
Anew force in 
chromatin research 


Fundamental questions about 
the physical nature of chromo- 
somes remain unanswered, 
largely due to the absence of 
direct mechanical measure- 
ments inside the nuclei of living 
cells. Keizer et al. developed 
a technique to measure how 
a genomic locus, in its native 
nuclear context, responds to 
a point force of physiological 
magnitude (see the Perspective 
by So and Tanner). Interphase 
chromatin was found to be liquid 
ike, with moderate hindrance 
and topological effects, contrast- 
ing with the common view of a 
crowded and entangled nuclear 
environment. Ultimately, these 
measurements allow a deeper 
understanding of how genomic 
elements can move under active 
biological forces and provide a 
basis for developing new physi- 
cal models of chromosomes. 
—DJ 

Science, abi9810, this issue p. 489; 

see also add5444, p. 472 


BIOIMAGING 
Continuous long-term 
ultrasound imaging 


Ultrasound is widely used for the 
noninvasive imaging of tissues 
and organs, but this method 
requires close contact between 
the transducer and the target 
area. This can make it difficult 
to acquire images over a long 
period of time, especially if the 
patient needs to be mobile. 
Wang et al. describe a wear- 
able ultrasound imaging device 
(see the Perspective by Tan and 
Lu). A rigid piezoelectric probe 
array is bonded to the skin 

with an acoustically transpar- 
ent hydrogel elastomer. In vivo 
testing showed that the device 
could be comfortably worn for 
A8 hours, and hooking the array 
up to a commercially available 
ultrasound platform allowed for 
continuous ultrasound images 
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of the carotid artery, lung, and 
abdomen. —MSL 
Science, abo2542, this issue p. 517; 
see also adc8732, p. 466 


SPECIES INTERACTIONS 
“Pollination” of red algae 


Pollination by animals is a wide- 
spread and important symbiotic 
relationship that increases 
plants’ genetic diversity and 
fertilization success. Lavaut et 
al. discovered that this relation- 
ship is not unique to plants; it 
is also found in the dioecious 
red alga Gracilaria gracilis (see 
the Perspective by Ollerton and 
Ren). The isopod /dotea balthica 
carries the alga’s spermatia 
on its body, and experiments 
showed that isopods moving 
from a male to a female alga 
increased the alga’s fertilization. 
Transport of algal gametes by 
isopods could be an important 
mechanism for reproduction 
because the spermatia lack 
flagella and otherwise rely on 
water currents to transport 
them. —BEL 

Science, abo666l, this issue, p.528; 

see also add3198, p_471 


MOLECULAR BIOLOGY 


Recording events in cells 
CRISPR-mediated DNA editing 
has turned toward recording 
events in cells, allowing cell 
differentiation lineages to be 
determined in vivo and poten- 
tially enabling gene expression 
and cell signaling to be recorded. 
These CRISPR information 
recorders rely on mutations 
being introduced to synthetic 
“DNA tapes” that are expressed 
in cells and can be sequenced 
after tissue development and 
growth to reconstruct changes 
over time. In a Perspective, 
Masuyama et al. discuss the 
potential of this approach 

to better understand the 
development and pathogenic 
progression of tissues such 

as tumors. Technological 
challenges currently limit the 
application of these systems, 


but improvements are being 
developed rapidly. —GKA 
Science, abo3471, this issue p. 469 


CARBON NANOTUBES 
Ordering surface 
modifications 


Although several methods exist 
for functionalizing the surface 
of carbon nanotubes, the sites 
modified tend to be random. 
However, ordered functionaliza- 
tion of single-walled carbon 
nanotubes (SWCNTs) could 
lead to quantum properties. 
Lin et al. screened short DNA 
sequences containing cyto- 
sine (C) and guanine (G) to 
control the placement of G for 
photo-—cross-linking reactions 
(see the Perspective by Wang). 
Spectroscopy and cryo-electron 
microscopy showed that for the 
(8,3)-SWCNT enantiomer, the 
sequences C,GC,GC, regularly 
created an ordered helical 
structure with a 6.5-angstrom 
periodicity in which G bridges 
two carbon atoms. —PDS 
Science, abo4628, this issue p. 535, 
see also abq2580, p.473 


PARKINSON’S DISEASE 
Microglia modulates 
PD risk 


Previous studies have identified 
variants in the LRRK2 gene that 
are associated with increased 
risk of developing Parkinson's 
disease (PD). Langston et al. 
studied the mechanisms link- 
ing LRRK2 variants to PD and 
showed that the disease risk 

is determined by microglial 
cells. Studying PD-associated 
LRRk2 variants in patient- 
derived tissue and induced 
pluripotent stem cell-derived 
microglia, the authors identified 
a quantitative trait in microglia, 
suggesting that although this 
cell type expresses LRRK2 in 
lower amounts compared with 
other cell types, it might be 
critical for determining PD risk. 
The lack of correlation between 
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cell-type-specific baseline 
expression and effect on dis- 
ease pathophysiology should be 
considered when evaluating the 
mechanisms mediating this and 
other disease-associated vari- 
ants in noncoding gene regions. 
—MM 

Sci. Transl. Med. 14, eabp8869 (2022). 


PHYSIOLOGY 
A painful breakdown 
of cartilage 


The proinflammatory cytokine 
interleukin-6 (IL-6) is associ- 
ated with inflammatory pain 
and structural changes in joints 
that characterize osteoarthritis. 
Using a mouse model of injury- 
induced osteoarthritis, Liao 
et al. demonstrated that IL-6 
promoted both disease symp- 
toms in a sex-specific manner 
(see the Focus by Pacifici). In 
male mice, but not in female 
mice, IL-6 promoted articular 
cartilage degeneration and pain 
signaling through distinct sig- 
naling pathways. These findings 
have implications for potential 
sex-specific osteoarthritis 
therapies. —-AMV 

Sci. Signal. 15, eabn7082, 

eadd3702 (2022). 


PLANT SCIENCE 
Nucleotide-based 
signals protect 


Key receptors in flowering plants 
respond to pathogen effector 
molecules and trigger the plant's 
immune response when needed. 
Huang et al. and Jia et al. now 
tell the story of a branching 
signaling pathway through which 
receptors with an N-terminal 
Toll/interleukin-1 receptor (TIR) 
domain enzymatically pro- 
duce small, nucleotide-based 
second messengers and initiate 
immune responses in the small 
mustard plant Arabidopsis. The 
pathways diverge to activate 
different immune responses, 
with one pathway controlling 
the cell death response and the 
other managing transcriptional 
change. —PJH 

Science, abq3297, abq8180, 

this issue p. 487, 488 
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SOLAR CELLS 
Managing excess 
lead iodide 


In hybrid perovskite solar cells, 
the formation of lead iodide 
(PbI,) can provide some pas- 
sivation effects but can lead to 
device instability and hysteresis 
in current-density changes with 
voltage. Zhao et al. show that 
doping with rubidium chloride 
(RbCl) can create a passive 
inactive (PbI,),RbDCI phase that 
stabilizes the perovskite phase 
and lowers its bandgap. Devices 
exhibited 25.6% certified power 
efficiency and maintained 80% 
of that efficiency after 500 hours 
of operation at 85°C. —PDS 
Science, abp8873, this issue p. 531 
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INNATE IMMUNITY 


Shared TIR enzymatic functions regulate 
cell death and immunity across the tree of life 


Kow Essuman, Jeffrey Milbrandt, Jeffery L. Dangl, Marc T. Nishimura* 


BACKGROUND: Diverse organisms, from archaea 
and bacteria to plants and humans, use receptor 
systems to recognize both pathogens and 
dangerous self-derived or environmentally 
derived stimuli. These intricate, well-coordinated 
immune systems, composed of innate and 
adaptive components, ensure host survival. In 
the late 20th century, researchers identified 
the Toll/interleukin-1/resistance gene (TIR) 
domain as an evolutionarily conserved compo- 
nent of animal and plant innate immune sys- 
tems. Today, TIR-domain proteins are known 
to be broadly distributed across the tree of 
life. The TIR domain was first recognized as an 
adaptor for the assembly of macromolecular 
signaling complexes in mammalian innate im- 
mune pathways. Work on axon degeneration 
in animals—as well as on plant, archaeal, and 
bacterial immune systems—has uncovered ad- 
ditional enzymatic activities for TIR domains. 


ADVANCES: Mammalian axons initiate a self- 
destruct program upon injury and during 
disease that is mediated by the sterile alpha 
and TIR motif containing 1 (SARM1) protein. 
The SARMI1 TIR domain enzymatically con- 


sumes the essential metabolic cofactor nic- 
otinamide adenine dinucleotide (NAD*) to 
promote axonal death. Identification of the 
SARM1 NAD*-consuming enzyme (NADase) 
revealed that TIR domains can function as 
enzymes. Given the evolutionary conserva- 
tion of TIR domains, studies investigated 
whether the SARM1 TIR NADase was also 
conserved. Indeed, bacteria, archaea, and 
plant TIR domains possess NADase activ- 
ity. In prokaryotes, TIR NADase activity is 
found in an ancient antiphage immune sys- 
tem. In plants, identification of TIR NADase 
activity and linkage of TIR enzymatic pro- 
ducts to downstream signaling components 
addressed the question of how nucleotide- 
binding, leucine-rich repeat (NLR) recep- 
tors trigger hypersensitive cell death during 
an immune response. Studies in plants show 
that their TIR domains can cleave nucle- 
ic acids and possess 2’,3’ cyclic adenosine 
monophosphate (2',3'-cAMP) and 2’,3' cy- 
clic guanosine monophosphate (2’,3’-cGMP) 
synthetase activity that aids cell death pro- 
grams in plant innate immunity. Thus, TIR 
domains constitute an ancient family of 


enzymes that are activated in immune and 
cell death pathways. 


OUTLOOK: The discovery of TIR-domain en- 
zyme activities carries implications for innate 
immunity and neurodegeneration. The identi- 
fication of the SARM1 NADase defined a drug 
target for a wide number of neurodegenerative 
diseases that is being exploited in both preclin- 
ical and clinical studies. Hyperactive mutations 
in the SARM1 NADase have been discovered 
in amyotrophic lateral sclerosis (ALS) patients. 
Future work will seek to clarify the contribu- 
tion of the SARM1 axon degeneration pathway 
to ALS pathogenesis. NAD" biology influences 
cellular processes from metabolism to DNA 
repair to aging. How TIR enzymes influence 
the NAD* metabolome and its associated path- 
ways in bacteria, archaea, plants, and animals 
will be an exciting area for upcoming investi- 
gation. The discovery of the diversity of TIR 
enzymatic products is revealing signaling 
pathways across kingdoms. Discovery of TIR 
enzymatic function in plants and animals may 
yet inspire studies of enzymatic functions for 
Toll-like receptors in animals. We anticipate 
that cross-kingdom studies of TIR-domain 
function will guide interventions that will span 
the tree of life, from treating human neuro- 
degenerative disorders and bacterial infections 
to preventing plant diseases. 


The list of author affiliations is available in the full article online. 
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Conserved TIR-domain enzymatic activity. TIR-domain proteins from prokaryotes 
and eukaryotes cleave NAD" into nicotinamide (Nam), ADP-ribose (ADPR), cyclic 
ADP-ribose (cADPR), isomers of cyclic ADP-ribose (2' or 3'cADPR), and related 
molecules [e.g., phosphoribosy! adenosine monophosphate (pRib-AMP)]. Plant TIR 
domains also possess a nuclease activity, can degrade DNA and RNA, and can 
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function as a 2',3'-cAMP or 2',3'-cGMP synthetase. TIR enzymatic activity drives cell 
death and immune pathways across kingdoms. TIR activity can kill cells directly 
through NAD* depletion or indirectly using enzymatic products as signal molecules. 
The representative TIR domain structure shown here is Protein Data Bank ID 600Q. 
EDS1, enhanced disease susceptibility 1; ThsA, Thoeris A. 
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Shared TIR enzymatic functions regulate 
cell death and immunity across the tree of life 


Kow Essuman’, Jeffrey Milbrandt”**, Jeffery L. Dangl°®, Marc T. Nishimura’* 


In the 20th century, researchers studying animal and plant signaling pathways discovered a protein 
domain that is shared across diverse innate immune systems: the Toll/interleukin-1/resistance gene 
(TIR) domain. The TIR domain is found in several protein architectures and was defined as an adaptor 
that mediates protein-protein interactions in animal innate immunity and developmental signaling 
pathways. However, studies of nerve degeneration in animals—and subsequent breakthroughs in 
plant, bacterial, and archaeal systems—revealed that TIR domains possess enzymatic activities. 

We provide a synthesis of TIR functions and the role of various related TIR enzymatic products in 
evolutionarily diverse immune systems. These studies may ultimately guide interventions that would 
span the tree of life, from treating human neurodegenerative disorders and bacterial infections to 


preventing plant diseases. 


nnate immune systems of living organ- 

isms comprise a set of molecular, cellular, 

and physiological responses that are rap- 

idly mobilized in response to pathogens 

and harmful stimuli derived from dam- 
aged host cells (/, 2). Studies of animal and 
plant innate immune responses identified the 
evolutionarily conserved Toll/interleukin-1/ 
resistance gene (TIR) domain as a common 
element in proteins involved in innate immu- 
nity (Fig. 1) (3). In animal innate immune sys- 
tems, TIR domains provide the signaling motif 
of Toll-like receptors (TLRs) and interleukin-1 
receptors (IL-1Rs) (4). These TIR domains 
function as scaffolds to assemble signaling 
complexes and transduce defense responses 
that include activation of nuclear factor «B 
(NF-«B) signaling and interferon production 
(4). Activation of mammalian TIR protein 
complexes also stimulates aerobic glycolysis 
in functional immune cells (5, 6). In plants, 
TIR domains are typically found at the amino 
terminus of nucleotide-binding leucine-rich 
repeat receptor (NLR) proteins (Fig. 1) (7, 8). 
Canonical plant NLRs possess a nucleotide 
binding site (NBS), leucine-rich repeat (LRR) 
domain, and a TIR domain or coiled-coil (CC) 
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domain at their N terminus, giving rise to TIR- 
NLRs (TNLs) and CC-NLRs (CNLs), respec- 
tively. NLRs are the key intracellular receptors 
in the plant innate immune system (7, 8) and 
are structurally similar to animal NLR pro- 
teins that signal by oligomerizing into inflam- 
masomes (7). In response to pathogenic and 
endogenous danger signals, plant NLRs trig- 
ger arapid immune response that initiates a 
transcriptional reprogramming that halts 
pathogen growth and often culminates in 
localized cell death (9). TIR domains are also 
encoded (more rarely) in some fungal and pro- 
tozoal genomes where their function remains 
to be elucidated (10, 17). More recently, TIR 
domain proteins were discovered to be widely 
found in bacteria and archaea (Fig. 1) (12-17). 
Initial characterization of these bacterial pro- 
teins indicated that they function as virulence 
factors to interrupt TLR signaling in the mam- 
malian host, but recent work also defines intra- 
cellular signaling roles in bacterial antiphage 
defense (12, 13, 16, 18). Thus, TIR domain im- 
mune function is found across kingdoms in 
diverse systems. 

The original molecular function of TIR 
domains as signaling scaffolds is well doc- 
umented for innate immune responses in 
animals (4, 19-21). Recent work, however, 
demonstrated that TIR proteins are also an 
ancient enzyme family across all domains of 
life (15, 18, 22-26). TIR domains can act as 
nicotinamide adenine dinucleotide (NAD*) 
hydrolases (15, 22-26). Additionally, at least 
some plant TIR domains are bifunctional 
enzymes that are also able to hydrolyze both 
NAD* and DNA and RNA, thus also acting as 
2',3' cyclic adenosine monophosphate (2’,3'-cAMP) 
and 2’,3' cyclic guanosine monophosphate 
(2',3'-cGMP) synthetases (23, 24, 27). In this re- 
view, we discuss the biology of TIR domains 
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with an emphasis on enzymatic functions 
in immune and cell death pathways (sum- 
marized in Fig. 1B). 


Discovery of the TIR domain 


The identification of the TIR domain began 
with the characterization of three key genes 
that encode the mammalian IL-1R, Drosophila 
Toll, and the tobacco N protein, which con- 
ditions resistance to tobacco mosaic virus (19). 
In the late 1980s, IL-1R was cloned (28). A few 
years later, its cytosolic domain was found 
to be homologous to the cytosolic domain of 
Drosophila Toll (29, 30). Toll, at that time, 
was implicated in the dorsoventral pattern- 
ing developmental process of Drosophila 
that involved signaling by the NF-«B factor 
Dorsal (31, 32). IL-1 was also shown to ac- 
tivate NF-«B (29, 33). In 1994, researchers 
studying the plant immune system discovered 
that the gene N encodes an amino-terminal 
domain with substantial homology to the 
carboxyl-terminal domain of Toll and IL-1IR 
(34). Because of its N-terminal TIR domain, 
Nis aTNL receptor. Its activation by recog- 
nition of a protein from the tobacco mosaic 
virus leads to hypersensitive programmed 
cell death and restriction of viral replication 
(34). Functions for Toll were later extended 
beyond development to immune activation 
during bacterial and fungal infections (35-37). 
These discoveries led to the identification 
of several mammalian TLR proteins har- 
boring TIR domains and a large superfam- 
ily of intracellular plant TNL receptors that 
are widely distributed across dicotyledon- 
ous plant genomes (4, 7). Single-domain 
plant TIR proteins and “TIR-X” proteins 
(where X is a domain of unknown func- 
tion) are also common and functional (Fig. 1) 
(38-40). 


TIR domains in animal immune signaling 


The realization that TIR domains function 
as scaffolds emerged from extensive work in 
animal innate immune signaling (9). In ani- 
mals, TIR domains are found in Toll and 
TLRs, their cytosolic adaptor proteins, and the 
IL-1R family (4, 79) (Fig. 1). TLR receptors are 
pattern recognition receptors, which detect 
pathogen- or microbe-associated molecular 
patterns (PAMPs and MAMPs) and activate 
downstream signaling pathways (4). The de- 
tection of PAMPs or MAMPs occurs through 
the N-terminal domain of TLR dimers at the 
cell surface or within endosomal membranes 
(4). This triggers TLR C-terminal TIR do- 
main dimerization, recruitment of cytosolic 
TIR adaptors (4, 21, 41), and TIR-mediated 
cooperative assembly of a supramolecular 
organizing complex (19-21, 42, 43). Complex 
formation leads to signaling through activa- 
tion of downstream protein kinases and tran- 
scription factors, and then to host inflammatory 
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Fig. 1. Diverse TIR domain proteins across the tree of life share related 
enzymatic functions to regulate cell death and immunity. (A) TIR domains 
(labeled T, green circle) are present in diverse domain configurations from animal, 
plants, and prokaryotes. Many domain configurations are not shown. Plant genomes 
encode TIR domain proteins with diverse domain architectures. Bacteria and 
archaea also possess TIR domain proteins with diverse domain architectures, 
including TIR domains located at either the N or C termini. Other domains present 


in prokaryotic TIR proteins that are represented by Y-Z" in the figure include CC 
domains, STING, WD 40 repeats, and the Mprlp Padlp N-terminal domain. DD, death 
domain; ID, intermediate domain; MyD88, myeloid differentiation primary response 
protein 88; TPR, tetratricopeptide repeat. (B) Across the tree of life, TIR domains 
function as enzymes to process nucleotide-containing molecules, either to generate 
signals or to deplete them from the cell. TIR products or NAD* depletion are often 
associated with beneficial cell death or immune activation. 


and defense responses (4, 42, 44). TIR domains 
in the IL-1IR family share similar intracellular 
signaling mechanisms as TLRs and influence 
myriad innate and adaptive immune responses 
(4, 45-47). 

Although animal TIR adaptor proteins gen- 
erally promote TLR signaling, the discovery 
that the TIR “adaptor” protein, sterile alpha 
and TIR motif containing 1 (SARM1), inhib- 
its TLR signaling indicated that it might 
have distinct functions (48). Indeed, the 
Caenorhabditis elegans SARM1 ortholog 
(called TIR-1) has essential roles in both 
immunity and development (49-57). TIR-1 is 
required for resistance to certain fungal and 
bacterial infections (49) and specification 
of asymmetric odorant receptor expression 
during neuronal development (50). Thus, 
C. elegans TIR-1 function is reminiscent of 
Toll function in dorsoventral patterning in de- 
velopment and innate immunity (31, 35-37). 
Another distinguishing feature of SARM1 
among TIR adaptors is its enzymatic role in 
driving axon degeneration (22, 52, 53) (Fig. 2), 
which laid the groundwork for the discovery 
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of TIR domain enzymatic function across 
kingdoms. 


TIR domains function in neuronal cell death 


In the mid-19th century, neurophysiologist 
Augustus Waller described the degeneration 
of injured frog hypoglossal and glossopha- 
ryngeal nerves (54). Wallerian degeneration 
initially was thought to be a passive wasting 
of the distal segment of a damaged axon, but 
this view was challenged by the serendipitous 
discovery of a naturally occurring spontaneous 
mutant mouse, the Wallerian degeneration 
slow (Wid*) mutant (55). In WId* mice, tran- 
sected distal axons remain intact for several 
days after injury and continue to conduct ac- 
tion potentials and remain metabolically active 
(55, 56). The axonal protection observed in 
the Wid’ mice is caused by a gain-of-function 
mutation that results in a fusion protein (Wl1d°*) 
composed of portions of the ubiquitination 
factor UBE4B and nicotinamide mononucleo- 
tide adenylyltransferase 1 (NMNATI) (56, 57). 
The enzymatic activity of NMNATI, which 
converts nicotinamide mononucleotide (NMN) 
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to NAD* (Fig. 3), is required for Wld* to protect 
axons (58). Unraveling the mechanism of this 
hypermorphic mutation revealed an active, 
genetically encoded death program in axons 
after injury (53, 59-63). 

Genetic screens in Drosophila and in mouse 
dorsal root ganglia neurons identified SARM1 
as a required component of a conserved axon 
death program (67, 62). Subsequent experiments 
using SARM1 knockout mice showed robust 
axonal protection comparable to that observed 
in Wid mice (56, 61, 62). Transected axons from 
Drosophila SARM1 mutants can remain intact 
for at least 6 weeks, a duration that approximates 
the life span of the fly (67). Loss of SARM1 delays 
axon degeneration and improves functional out- 
comes in several neurodegenerative disease 
mouse models, including chemotherapeutic 
and diabetic peripheral neuropathy, traumatic 
brain injury, glaucoma, retinopathy, and amyo- 
trophic lateral sclerosis (ALS) (64-70). Loss of 
SARM1 also blocks axon degeneration in hu- 
man induced pluripotent stem cell-derived 
sensory neurons (7/). In the absence of in- 
jury, SARM1 knockout mice display no overt 
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Fig. 2. Working model of the Wallerian axon 
degeneration pathway. Activation of the Wallerian 
axon degeneration pathway during injury involves 
the reduction of axon protective factors, primarily 
nicotinamide mononucleotide adenylyltransferase 2 
(NMNAT2). The level of NMNAT2 is regulated by 
multiple mitogen-activated protein (MAP) kinases, 
PAM-Highwire-Rpm-1 (PHR1), F Box protein 45 
(FBX045), and S phase kinase associated protein 

1 (SKP1). Low levels of NMNAT2 lead to an increase 
in NMN and a decrease in NAD*. Both NMN and 
NAD* bind to SARM1, where a high NMN:NAD* ratio 
is responsible for potent SARM1 NADase enzyme 
activation. Activated SARM1 NADase cleaves NAD* 
into Nam, ADPR, and cADPR. SARM1-mediated 
NAD* cleavage leads to energetic failure through 
axonal ATP loss and subsequent mitochondrial 
depolarization, calcium influx, and, ultimately, 

axon fragmentation. 


phenotype, suggesting that SARM1 inhibi- 
tion may be an attractive therapeutic strat- 
egy for neurodegenerative diseases (72-74). 


SARM1 reveals an enzymatic TIR function 


SARM1 is a multidomain protein with an 
N-terminal regulatory region that contains 
multiple Armadillo repeats, a tandem sterile 
alpha motif (SAM) domain that mediates 
SARMI1 multimerization, and a C-terminal 
TIR domain (52, 62) (Fig. 1). Forced dimeriza- 
tion of the SARM1 TIR domains leads to rapid 
NAD* depletion that is sufficient to trigger 
axon degeneration (75). Because TIR domains 
function as molecular scaffolds to promote 
the formation of signaling complexes, it was 
hypothesized that axon injury promoted di- 
merization of SARM1 TIR domains that re- 
sulted in the interaction and activation of an 


Essuman et al., Science 3'77, eabo0001 (2022) 


29 July 2022 


trae i 
9 ‘eed y ™ 
- w 
MA, a <a: 
9 
ail NH ig 
fal 2 NAMPT of. NMINAT we: eS 
N Co 
OH oH “ 
Nam | 8 
o7-a 
a pe, 
oH 
- = = = NAD* 
i see / Sabb 
/ Bactena TIRs Bacteria TIRs 
’ + Plant THRs 
a 9 
ms sis as 
Sy. TSX 
' pis 
ae a 9 ay 
=) 
n | 8 
0... oF-0-7 9 
aoa ale 
= oH OW 
cADPR ADPR 
on 
B a-9-0— 9 
on on tea, 
: t a8 
wn mH ° a ° ? @ od mo 
Avy B rr yon? O-F-0-F-0-F-0_o_ | 
( é ow -o } on om mn” 
9 ae oN on > o* 
m-F-0-P-0 "---o Lad ow °o 
én be = J ou * .} e ; ono 0.) 
° J ° on o war PY oe 
a $07 oj a4 oe 5 
yoy A on k 4 a ° Rey 
oH OH 1 i ‘= | € é 
Rib-ADP 7 . ) ‘ ¥ . 
pI pRib-AMP een 21 ¥ ee key 
on “hey om oH 
oy oH 
c ‘ di ADPR ADP ATP 
Cy WH, 
> hi tH 
oF-05 oh Ny 
ae SP 
f 9 ies 
My HO -? -> 
o ae hydrolysis dH 
€y } ot 
| Nm wa +H. Q 
0#-0- 6 sa TRE sa 
free, <ai> AEF 
oe OW OH 
NAD* pRit-AMP 
D re 
East ft 
1. @s.9 ae a? WH, 7 
fr t-07 o> 2] -° " h, , ones 
: ‘f had 
ta | 9 4 " 
{ ; o~ te on er 
pRib-AMP | £2 r 
' ° sn [-O-$-0-4-07) 5. e 
i a oH 
! on oH ow 
H eo. 
pftb-ADP sd ie} 
OS be 7 
poly ADP-Ribose On 


Fig. 3. Select pathways of NAD* synthesis and TIR-mediated NAD* degradation. (A) Nicotinamide 
phosphoribosyltransferase (NAMPT), the rate-limiting enzyme in NAD* biosynthesis, produces NMN from Nam 
and 5-phosphoribosyl-1-pyrophosphate (PRPP) (not shown). NMNAT synthesizes NAD* from NMN and ATP. 


NAD* can be cleaved by prokaryotic and eukaryotic TIR domains. The SARM1 TIR domain cleaves NAD* into Nam, 


ADPR, and cADPR. Bacterial TIRs can cleave NAD* into Nam, ADPR, cADPR, and v-cADPRs. Plant TIRs can 
also cleave NAD* into Nam, ADPR, and v-cADPR. Two linkages of v-cADPR have been described: 2'cADPR 
and 3'cADPR. (B) Plant TIRs generate small-molecule signals that activate the EDS1 complex. pRib-AMP 
and pRib-ADP promote the EDS1-PAD 4 interaction with ADR1, whereas ADPr-ATP and di-ADPR promote 
EDS1/SAG101 interaction with NRGL. (€) 2'cADPR and TIR-derived EDS1 signals share a 1'-2' ribose-ribose 
linkage. (D) Hypothetical cleavage sites in poly(ADP-ribose) that could produce plant-derived TIR molecules. 
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NAD*-consuming enzyme in the axon (22, 75). 
However, biochemical experiments using pu- 
rified proteins established that the SARM1 
TIR domain itself possesses NAD* hydrolase 
activity (22). SARM1 hydrolyzes NAD* through 
a reaction that requires a glutamate residue 
[Glu®*? (E642) in human SARM1] in the TIR 
catalytic pocket (22, 24). This NAD* hydrolase 
activity and conserved glutamate is present 
in SARM1 homologs from multiple species, 
including humans, mice, flies, and worms 
(22, 24, 26). 

SARMI1TIR cleaves NAD* at the nicotin- 
amide ribosyl bond to produce nicotinamide 
(Nam), adenosine diphosphate ribose (ADPR), 
and cyclic ADPR (cADPR) (22, 24, 26) (Fig. 3). 
SARM1 also possesses base-exchange activity 
that can generate nicotinic acid adenine di- 
nucleotide phosphate (NAADP) (26, 76) as well 
as modifications of other compounds (77). 
SARM1 activation in axons leads to deple- 
tion of NAD* and adenosine triphosphate 
(ATP), followed by calcium influx, mitochon- 
drial depolarization, and axon degeneration 
(60, 75, 78, 79) (Fig. 2). The discovery of the 
SARM1 NAD*-consuming enzyme (NADase) 
reemphasized the importance of NAD* in 
axon degeneration. Later studies showed that 
the Wid* functional moiety, NMNATI, con- 
fers axonal protection by inhibiting SARM1- 
mediated NAD* depletion (80). 


Shared TIR enzymatic function across the 
tree of life 


The discovery that the SARM1 TIR domain is 
an NAD* hydrolase suggested that TIR do- 
mains in proteins that are present in different 
kingdoms might also possess enzymatic activ- 
ity. The SARM1 TIR domain is phylogeneti- 
cally and structurally more related to bacterial, 
archaeal, and plant TIR domains than to TIR 
domains implicated in animal innate im- 
munity (/4, 24). Indeed, purified TIR domain 
proteins of human TLR4 and other animal 
TIR adaptors failed to cleave NAD* in vitro 
(22, 24, 81). This result reinforced phylogenetic 
analyses and suggested that SARM1 may have 
functions distinct from other mammalian TIR 
domains. Biochemical assays using bacterial, 
archaeal, and plant TIR domains demonstrated 
that many of them can function as NADases 
(15, 17, 18, 23-26). A conserved glutamate is 
present in plant, bacterial, and archaeal TIR 
domains and is required for their NAD* hy- 
drolase activity (15, 16, 23, 24). Other closely 
related analogs of NAD*, such as nicotinamide 
adenine dinucleotide phosphate (NADP), can 
be cleaved by SARMI, plant, and bacterial 
TIR proteins (15, 23, 24, 76). TLRs and their 
adaptors have not been shown to possess TIR 
enzymatic function thus far, but their ability 
to form complexes with other enzymes sug- 
gests that TIR-associated enzymatic activation 
may be a conserved mechanism across king- 
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doms (6, 19, 42). Moreover, TLRs can activate 
aerobic glycolysis and metabolic reprogram- 
ming (5, 6, 82). This, coupled with NAD* being 
an indispensable and ubiquitous metabolic 
factor (83), implicates TIRs as indirect medi- 
ators of metabolism. Future studies should 
clarify if mammalian TIRs that lack NAD* 
cleavage activity might still bind NAD* and 
thus also influence signaling pathways in this 
manner. 


Bacterial and archaeal TIR domains 
are enzymes 


TIR domain NADases are present and active in 
diverse prokaryotes from a range of habitats 
(15-18, 25, 84). Prokaryotic TIRs also cleave 
NAD* at the nicotinamide ribosyl bond (15, 25) 
(Fig. 3) but can create multiple types of products. 
For example, TIR proteins from Staphylococcus 
aureus (TirS) and uropathogenic Escherichia 
coli (TepC) generate Nam and ADPR (15). By 
contrast, TIR proteins BtpA (from Brucella), 
AbTir (from Acinetobacter baumannii), TepO 
(from the archaea Methanobrevibacter olleyae), 
and other prokaryotic TIR domains generate 
a variant of cADPR (v-cADPR) (J5, 18, 84) (Fig. 
3). v-cADPR was also detected in gnotobiotic 
mice colonized with human gut bacterial strains 
(77). Plant TIRs also produce v-cADPR in vitro 
and in vivo (23). Finally, yet another variant 
of cADPR (v2-cADPR) is produced by the bac- 
terial TIR virulence factor HopAM1 and other 
bacterial TIR proteins (17, 84, 85). The cADPR 
isomers produced by plant TIRs and several 
bacterial TIRs have the same molecular weight 
as cADPR but are distinct, as shown by re- 
tention times in chromatography experiments 
(15, 18, 23, 85). The first chemical structures of 
plant and prokaryotic v-cADPR and v2-cADPR 
molecules indicate that they are circularized 
by means of ribose-ribose linkages (84, 86) 
(Fig. 3). v-cADPR is circularized by means 
of a 1"-2' glycosidic linkage between riboses 
(hereafter, 2’cADPR), whereas v2-cADPR is 
circularized by means of a 1”-3' glycosidic 
linkage (hereafter, 3’cADPR) (84, 86). It re- 
mains to be seen whether TIRs are capable of 
other linkages or whether there are isomeric 
differences between cADPR molecules that 
share the same linkage. 

Bacterial TIR domain proteins were first 
classified as virulence factors and thought 
to act by interacting with mammalian host 
TLRs to disrupt innate immune pathways 
(12, 13, 87-89). BtpA is a bacterial virulence 
effector protein from Brucella abortus that 
reduces total intracellular NAD", an activ- 
ity that requires the catalytic glutamate (89). 
Similar strategies are used by plant pathogens. 
The plant pathogen Pseudomonas syringae 
pv. (pathovar) tomato strain DC3000 encodes 
a TIR domain-containing virulence protein 
called HopAM1. HopAM1 virulence pheno- 
types are dependent on its conserved catalytic 
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glutamate (84, 85). The mechanism by which 
HopAM1’s enzymatic activity suppresses plant 
immunity remains unknown. One hypothesis 
is that the HopAM1 TIR competes for a plant 
TIR substrate that is required to promote 
immunity. An alternate hypothesis is that 
HopAM1-produced 3’cADPR, or related enzy- 
matic products, interfere with the host’s im- 
mune system. 


Bacterial TIR enzymes in antiphage 
defense responses 


Global surveys of bacterial genomes revealed 
that TIR domain-encoding genes are enriched 
in genomic regions that contain phage defense 
genes, suggesting that they may function in 
suppressing phage infections (16). One system, 
Thoeris, comprises an operon that is composed 
of ThsA and ThsB. ThsA usually encodes an 
NAD* binding and cleavage domain at its N 
terminus (78, 90). ThsB encodes a TIR-domain 
protein (6). The Thoeris system is found in a 
wide number bacterial and archaeal genomes 
and restricts viral replication upon activation 
(16). Mutations of the conserved catalytic 
glutamate in the TIR domain of ThsB, or of 
residues that affect NAD* binding in ThsA, 
abolish phage protection (J6, 18). 

Upon bacteriophage infection, ThsB TIR do- 
mains produce a cADPR isomer (presumably 
3'cADPR) (8, 84), which then acts as a signal- 
ing molecule to activate ThsA (18). ThsA con- 
sequently consumes bacterial NAD*, promoting 
bacterial growth inhibition and/or cell death 
(Fig. 4). This process of metabolic arrest and 
cell death, termed abortive infection, is thought 
to curb the spread of the infecting phage at a 
population level (97, 92). Abortive infection is 
reminiscent of the axon self-destruct mech- 
anism that is mediated by SARM1 activation 
in damaged axons and the hypersensitive 
cell death response in plants after successful 
pathogen recognition (7, 52). The discovery 
that bacterial TIR domain-containing pro- 
teins mediate antiphage defense systems is 
an impressive example of the continuity of 
TIR immune function, extending from phage 
defense in bacteria to pathogen responses in 
plants and animals. In addition to ThsB, other 
phage defense proteins are TIR-domain NAD* 
hydrolases regulated by cyclic nucleotides 
(Fig. 4) (25, 93). Similarly, detection of foreign 
DNA by a prokaryotic Argonaute protein via 
guide RNAs activates TIR NADase activity in 
the SPARTA (short prokaryotic Argonaute and 
TIR-APAZ) immune system (94). 


TIR domains as enzymes in plant immune 
receptor function 


Plants deploy an intricate innate immune 
system that prevents disease (7, 9). Initial 
hypotheses imagined that similar to animal 
TIR proteins, plant immune receptors con- 
taining TIR domains could function through 
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Fig. 4. Bacterial TIR NADases in antiphage, 
abortive infection defense programs. Shown are 
the Thoeris (left), Pycsar (middle), and CBASS 
(cyclic oligonucleotide—based antiphage signaling 
system) (right) defense systems. In the Thoeris 
system, bacteriophage infection activates the host 
ThsB TIR NADase to produce an isomer of cADPR, 
which then activates ThsA, leading to NAD* depletion 
and subsequent metabolic crisis and growth arrest. 
In the Pycsar system, phage infection activates 

the PycC uridylate cyclase to produce cyclic uridine 
monophosphate (cCUMP). cCUMP activates the PycTIR 
NADase effector to promote abortive infection 
through NAD* depletion. In CBASS, CD-NTase 
(cGAS/DncV-like nucleotidyltransferase) enzymes 
generate cyclic oligonucleotides. In the example 
presented here, c-di-GMP is generated by a member 


of the CD-NTase family. The ci-di-GMP activates the NADase activity of the TIR-STING protein to cause 
bacterial growth arrest. Although the genomic location of CD-NTase and TIR-STING within “defense islands” 
suggests a role in bacterial defense, the phages that activate the TIR-STING system remain unknown. 


assembly of signaling complexes. The obser- 
vation that the animal SARM1 TIR domain 
expressed an enzymatic function prompted 
the discovery of NAD* hydrolase activity in 
both plant TIR-only proteins and TNL proteins 
(23, 24, 95). Although both animal SARM1 and 
plant TIR domain proteins trigger cell death 
after activation, they achieve this through 
distinct mechanisms. SARM1 possesses high 
enzymatic activity and rapidly depletes the 
cell of NAD‘, resulting in energetic catastrophe 
and death (22, 23, 75). Plant TIRs are less-active 
enzymes in biochemical assays, and in vivo TIR 
activity does not result in severe NAD* deple- 
tion (23). This raises the question of how TIR 
activity results in cell death if it does not in- 
duce energetic catastrophe. 

Enhanced disease susceptibility 1 (EDS1) 
is a conserved immune hub required for plant 
TIR-dependent cell death and disease resis- 
tance phenotypes across plant phylogeny (96). 
Downstream of EDS1 is a small set of ancient 
and conserved resistance to powdery mildew 8 
(RPW8)-like NLRs, called RNL “helper NLRs” 
that are also required for TIR-mediated cell 
death and immunity (97). RNL helper NLRs 
transduce TIR-EDS1-dependent signals into 
direct calcium channel formation and sub- 
sequent signaling (98, 99). If plant TIRs func- 
tioned through direct NAD* depletion, one 
might expect that they would not require 
downstream signaling components. Indeed, 
ectopic expression of SARM1 TIR triggers 
EDS1-independent cell death in plants (23). 
This is consistent with the hypothesis that 
strong NADases can kill cells through meta- 
bolic dysregulation. Most plant TIRs, by con- 
trast, are EDS1 dependent, even though they 
are enzymatically active in edsI mutants, as 
measured by the production of v-cADPR (23). 
Thus, plant TIR NADase function is upstream 
of EDS1 (Fig. 5). A class of TIR proteins is en- 
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coded in some plant genomes that lack EDS1 
(100). It is unclear how these EDS1-independent 
TIRs function or whether they are producing 
small-molecule signals. 

How, then, does TIR enzymatic activity ac- 
tivate downstream signaling? A pair of studies 
demonstrated a mechanistic link between TIR 
enzymatic products and EDS1 complex func- 
tion (J01, 102). Huang et al. (101) identified a 
small molecule after coexpressing an active TIR 
enzyme along with the downstream compo- 
nents EDS1 and PHYTOALEXIN DEFICIENT 4 
(PAD4) in insect cells (107). EDS1 functions as 
a heterodimer with one of two structurally 
related proteins, either PAD4 or SENESCENCE- 
ASSOCIATED GENE 101 (SAG101). EDS1, PAD4, 
and SAGI01 share a similar domain architec- 
ture, with an N-terminal lipase-like domain and 
a C-terminal domain called the “EP domain” 
for EDS1 and PAD4. SAG101 also contains an 
EP domain (Fig. 5). The EDS1-SAG101 crystal 
structure revealed a cavity formed in the het- 
erodimer, and EP domain residues that face 
this cavity are required for EDS1 complex 
function (103, 104). By purifying and crystal- 
izing the TIR-activated EDS1-PAD4 complex, 
Huang et al. found electron density in the EP 
domain pocket consistent with an ADPR- 
related molecule that they named phosphor- 
ibosyl adenosine diphosphate (pRib-ADP) (707). 
TIR activity induces preferential EDS1-PAD4 
heterodimer formation followed by recruitment 
and activation of downstream ACTIVATED 
DISEASE RESISTANCE (ADR) RNLs in vivo 
(104-106). Using an in vitro assay, Huang et al. 
found that synthetic pRib-ADP and a phos- 
phoribosyl adenosine monophosphate (pRib- 
AMP) can replace TIR activity that drives the 
interaction of EDS1-PAD4 heterodimers with 
downstream ADR RNLs (J07). The putative 
TIR catalytic glutamate was required both in 
vitro to drive EDS1 complex oligomerization 
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and in vivo for the TIR-only protein RBA1 
(response to the bacterial type III effector 
protein HopBA1) to promote pRib-AMP accu- 
mulation (38, /01). Thus, this study links TIR 
enzymatic function mechanistically to EDS1 
and downstream events, which likely induce 
RNLs to form active resistosome-like calcium 
channels (98, 99) (Fig. 5). In an accompanying 
research article (J02), plant TIRs were dem- 
onstrated to generate ADP-ribosylated ADPR 
(di-ADPR) and ADP-ribosylated ATP (ADPr-ATP) 
(Fig. 3B). These molecules are structurally related 
to pRib-AMP and pRib-ADP but preferentially 
activate the EDS1-SAGI101 heterodimer via 
a binding site overlapping that of the pRib 
molecules (/02). In vitro, the RPS4 TIR do- 
main was capable of producing all four EDS1 
complex signals from NAD* and ATP (102). 
These results provide a compelling model that 
explains how PAD4 or SAG101 form specific 
EDS1 heterodimeric complexes that selec- 
tively recruit and signal through the two 
subclasses of RNL helper NLRs, ADRs, or 
NRGs, respectively. This results in distinct 
outputs, one biased toward cell death and 
the other toward a transcriptional immune 
output by means of as-yet-unknown mecha- 
nisms (107). 

The plant protein BdTIR activates ThsA in 
the Thoeris system (78). Thus, plant TIRs can 
produce a conserved signal (or signals) recog- 
nized by downstream signaling molecules 
across a considerable phylogenetic distance. 
Similar experiments were performed in the 
opposite direction by expressing bacterial TIRs 
in plants (08). The human inflammasome 
perceives intracellular pathogen signals and 
oligomerizes into a wheel-like disc (109, 110). 
Using this chassis, Duxbury et al. generated a 
hybrid inflammasome that fused TIR domains 
to the oligomerization domains of an animal 
inflammasome (108). The hybrid receptor 
with a plant TIR domain activated cell death 
in planta in a ligand-specific manner. This 
death was EDS1 dependent, consistent with 
activation of the endogenous TIR cell death 
pathway. Similar fusions with bacterial AbTir, 
however, failed to trigger cell death but did 
produce measurable v-cADPR (2'cADPR) (108). 
Duxbury et al. concluded that TIR NADase 
activity and v-cADPR production appear nec- 
essary, but not sufficient, for plant cell death (but 
see section “Plant TIR domains also cleave DNA 
and RNA” for 2’,3'-cNMP synthetase activity). 

One outstanding question is the relevance 
of plant TIR-produced 2’cADPR (Fig. 3). Is this 
molecule an intermediate in the production of 
the ligands that drive specific EDS1 complex 
formation, does it have distinct functions, or is 
it immunologically irrelevant enzymatic noise? 
Could it have a function related to the 3’cADPR 
produced in the plant cell by phytopathogens 
via HopAM1? The 2’cADPR molecule was orig- 
inally described as a “biomarker” for TIR 
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Fig. 5. Intracellular plant immune receptor 
pathways. (A) Plant immune receptors are shown in 
green. Both CC-NLRs and TIR-NLRs have been 
shown to bind pathogen triggers (pink circles) 

and oligomerize into wheel-like resistosomes. For 
CC-NLRs, the pentameric resistosome generates 

an ion channel that allows an influx of Ca** and 
potentially other ions into the cytoplasm. TIR-NLRs 
form a tetrameric oligomer that activates the 
NADase activity of the TIR domain. TIR-only proteins 
can be activated by pathogen triggers to form a 
presumed NADase tetramer, which produces 
small-molecule signals (e.g., pRib-ADP) or a helical 
assembly that expresses both nuclease and 
2'3'-cNMP synthetase activity. TNL and TIR-only 
pathways are dependent on the downstream EDS1 
complex and CCrpwg-NLRs (RNL “helper NLRs”). 
(B) The EDS1 complex binds TIR enzymatic 
products, and this activated complex interacts 
selectively with ADR or NRG helper RNLs to 
activate immunity (see inset). SM, small molecules. 
(C) A hypothetical model for a two-tiered 

TIR immune system. The initial TIR-NLR response 
is triggered when a TIR-NLR or TIR-only protein 
recognizes the presence of a pathogen-dependent 
trigger. This results in production of enzymatic 
products and activation of the EDS1 complex and 
RNLs, as shown in (A) and (B). TIR amplification 
occurs when the initial pathogen response results in 
the transcription of a large number of “booster 
TIRs.” Booster TIRs amplify the initial response 

and result in the production of large amounts 

of 2'3'-cNMP, which then goes on to activate further 
immune response in both infected and neighboring 
uninfected cells. TF, transcription factor. 


enzymatic function in planta because its pres- 
ence was correlated with TIR immune function 
(23). It is notable that hydrolysis of the pyro- 
phosphate linkage of 2’cADPR would yield 
pRib-AMP (Fig. 3C). pRib-ADP has an ad- 
ditional phosphate that is not present in 
2'cADPR. Generation of pRib-ADP from 2’cADPR. 
would require either phosphorolysis of the 
pyrophosphate bond in 2’cADPR (//D) or phos- 
phorylation of pRib-AMP. 2’cADPR’s charac- 
teristic 1’-2' ribose-ribose linkage is also present 
in (poly)ADP-ribose, a molecule that is induced 
during plant immune responses (772). Cleavage 
of linear (poly)ADP-ribose could directly pro- 
duce three of the EDS1-complex signals: pRib- 
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AMP, pRib-ADP, and ADP-ribosylated ADPR 
(Fig. 3D). It will be interesting to determine 
whether TIRs can also use polyADP-ribose 
as a substrate, potentially releasing diADPR 
through hydrolysis of the 1"-2' linkage. 


Structural basis of TIR-domain 
enzymatic function 


The crystal structures of many animal, plant, 
and bacterial TIR domains have been deter- 
mined (43, 173-117). A conserved core fold is 
present in TIR domains that consists of a con- 
served central five-stranded B sheet (BA to BE), 
typically with five surrounding helical regions 


Infected Cell 


(aA to aE) separated by loops (0, 116). These 
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TIR-domain structures share similarities with 
the Rossman fold, an ancient motif with alter- 
nating B strands and o helices that can bind 
nucleotides such as NAD* (90, 118, 119). As 
noted above, mutation of the conserved glu- 
tamate residue in TIR domains abolishes en- 
zymatic activity and function in animal, plant, 
and bacterial systems. Additionally, a con- 
served aromatic amino acid was identified 
that is required for generating a cyclized ADPR 
product (cADPR, 2'cADPR, or 3’cADPR) (84). 
Thus, a shared TIR catalytic mechanism has 
been conserved throughout evolution. 

TIR activation is often driven by oligomer- 
ization. Two groups reported cryo-electron 
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microscopy (cryo-EM) structures for ligand- 
bound, activated plant TNL receptors (120, 121). 
The oligomerization state of activated TNL 
immune receptors bound to their ligands is a 
tetramer (Fig. 6). The pathogen effector ligand 
binds primarily to the LRRs and a downstream 
C-terminal domain and causes the NBS-LRR 
domains to oligomerize into a ring-like struc- 
ture. This structure brings four TIR domains 
into proximity and is reminiscent of the 
pentameric ZAR1 CNL resistosome (722) and 
several animal inflammasomes. The interac- 
tions between these TIR domains in their full- 
length protein context leverage interfaces 
predicted from truncated TIR-domain crystal 
structures to produce a tetramer composed of 
two pairs of TIRs (720, 127). Within each TIR 
pair, one of the TIR domains undergoes a 
conformational change in which the “BB loop” 
is rearranged to expose the putative catalytic 
glutamate residue. This transformation upon 
tetramerization would allow cleavage of NAD* 
and promote production of TIR-derived signal 
molecules. Cryo-EM studies of SARM1 in its 
active conformation revealed similar findings, 
with two SARM1 TIR molecules forming a 
substrate binding site (77). The BB loop is 
present in this proposed active site for NAD* 
cleavage (Fig. 6), and mutations in the BB loop 


Fig. 6. TIR protein oligomeriza- A 
tion strategies. (A) Activated 
plant TIR-NLR immune receptors 
and animal SARMI1 function as 
oligomeric proteins. TIR domains 
are shown in green, the oligomeric 
chassis is in teal, and, for the 
plant complex, the bacterial ligand 
is in magenta. The oligomerized 
plant TIR complex is tetrameric 
[RPP1—Protein Data Bank (PDB) 
ID 7CRC—is shown; also see ROQ1 
(PDB ID 7JLX)], and the animal 
SARMI1 is octameric (PDB IDs 
7NAK and 7NAL). (B) Convergent 
oligomerization strategies shown 
in (A) result in similar TIR inter- 
faces for plant TIRs (PDB ID 7CRC 
is shown) and SARMI1. Putative 
catalytic glutamates are shown in 
magenta. The BB loop is shown 
in purple, and the interaction 
interface mediated by the aA and 
aE helices (AE) is shown in 
orange. Graphics in (A) and (B) 
are adapted from Shi et al. (77). 


Plant TIR-NLR 
tetramer 


inhibit NADase activity and axon degenera- 
tion (22, 24, 123). Similar to SARMI, a sub- 
strate binding site in bacterial AbTir is formed 
by two TIR molecules. Mutations in the BB 
loop also impair enzymatic function (77, 84). 

The autoinhibitory regulation of plant NLRs 
and animal SARM1 share some common fea- 
tures. SARM1 exists as an octamer in a ring 
shape, with the tandem SAM domains form- 
ing the central core of the ring, the N-terminal 
armadillo repeat (ARM) domains wrapping 
around the ring, and the TIR domains lodged 
between the ARM domains on the outside of 
the ring (77, 174, 115, 124-127). The N-terminal 
autoinhibitory ARM domain in SARM1 inter- 
acts with the TIR domain (77, 114, 115, 123-126). 
This is analogous to the NBS and LRR domains 
in plant NLRs (7). After activation, the eight 
SARM1 TIR domains move above the plane 
of the ARM-SAM ring and reorganize in a 
two-by-four grid (77). The organization of the 
SARM1 TIRs is notably similar to that of the 
plant tetrameric TIR, using the same inter- 
faces and overall geometry (77). 

During activation of SARM1, both NAD* 
and NMN compete for an allosteric binding 
site within the autoinhibitory N-terminal do- 
main of SARM1, with the ratio of bbund NMN 
to NAD* dictating the activation state of the 


Animal SARM1 
octamer 


Metazoan TIR-STING 
oligomer 


Plant TIR 
oligomer 


enzyme (1/4, 115, 125, 126, 128). Higher NMN- 
to-NAD* ratios result in an activated enzyme 
that consumes NAD* and promotes axon death, 
whereas low ratios keep the enzyme in the off 
state (1/4). Mutations in the NMN-NAD* bind- 
ing pocket that disrupt NMN binding block 
both activation of NAD* hydrolase activity and 
injury-induced axon degeneration (7/4). Con- 
versely, hypermorphic mutations in the NUN 
binding pocket produce an enhanced or con- 
stitutively active form of the enzyme (//4). 
These structural studies of SARM1 clarify the 
activation mechanism of the TIR enzymatic 
activity and provide a blueprint for how other 
proteins with TIR-domain NAD* hydrolases 
may be regulated. 

Although plant TNLs and animal SARM1 
have convergent mechanisms for TIR activa- 
tion, not all TIRs follow their example as close- 
ly. The bacterial TIR-STING (TIR-stimulator 
of interferon genes) sensor is present in both 
prokaryotes and basal eukaryotes (25). TIR 
domains of TIR-STING proteins use a proline- 
rich loop to mediate TIR-TIR interactions but 
rely on the same conserved catalytic gluta- 
mate for NADase activity (Fig. 6) (25). The 
binding of cyclic-di-guanine monophosphate 
(c-di-GMP) to the STING domain of TIR-STING 
proteins drives oligomerization into filamentous 


Animal SARM1 TIR 
oligomer 


TIR 


Prokaryotic STING 
oligomer 


(C) Schematic cartoon of a plant TIR-nucleic acid helical oligomer. In contrast to the tetrameric TIR-NLR structure, helical TIRs are interconnected with repeating AE 
(orange) and DE (red) interfaces. The DE interaction interface is mediated by the aD and aE helices. The BB loop interacts with the minor groove of DNA. The cartoon is 
adapted from Yu et al. (27). (D) Both metazoans and prokaryotes have TIR NADases that are regulated by C-terminal STING domains. Oligomerization of TIR-STING 
proteins is controlled by cyclic-dinucleotide (sticks) binding to the STING domain (teal). The two monomers cross perpendicular to each other, indicated by dark and 
light shading. Green TIR domains associate through a distinctive proline-rich loop interaction surface (black dashed line). Putative catalytic glutamate is shown in 
orange. The prokaryotic TIR domain was not crystalized but is potentially similar to the metazoan structure. The metazoan TIR-STING structure is PDB ID 6WT7, and the 
prokaryotic STING structure is PDB ID 6WT4. The graphic in (D) is adapted from Morehouse et al. (25). 
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structures, and mutations that disrupt either 
c-di-GMP binding or oligomerization into fila- 
ments disrupt NAD* cleavage (25). The for- 
mation of filamentous structures was also 
observed upon binding of small molecules 
to additional bacterial TIR NADases (84, 129). 
The function of these higher-order structures 
remains to be elucidated but is reminiscent of 
prion-like protein activation seen in other im- 
mune defense proteins (130, 131). Thus, there 
are diverse, independently evolved strategies to 
oligomerize and/or activate TIR NADase activity. 


Plant TIR domains also cleave DNA and RNA 


Plant TIR enzymatic activity that produces 
small-molecule signals to activate either of 
the EDS1 RNL complexes is essential for func- 
tion. A second enzymatic function was de- 
scribed for plant TIRs (27). Yu et al. found that 
both TIR-only proteins and TIR domains 
derived from TNL proteins could degrade 
nucleotides, DNA, and RNA in vitro. Plant TIR 
domains produce cyclic nucleotide monophos- 
phates (CNMPs) such as 2',3’-cAMP and 2’,3’- 
cGMP, molecules that are stress signals in 
plants and animals (132, 133). 

Whereas the first plant TNL cryo-EM struc- 
tures revealed a tetrameric resistosome, new 
cryo-EM structures show that a plant TIR 
domain (lacking NBS-LRR domains) is capable 
of oligomerizing into helical structures (27). 
The purified L7 TIR domain is a C-terminal 
truncation that lacks the NBS-LRR domains 
that are found in the full-length L7 TNL pro- 
tein. The L7 TIR domain structure revealed 
two protofilaments that twist into a superhelix 
that also appears to contain nucleic acid (Fig. 6). 
This structure supports the hypothesis that 
plant TIRs can also act as nucleases and defines 
basic arginine and lysine residues that likely 
contact the nucleic acid. Indeed, the muta- 
tion of these residues completely blocks 2',3’- 
cNMP synthetase activity and TIR-triggered 
cell death but only partially impairs NADase 
function. Thus, NADase and synthetase activ- 
ity are separable, and partial NADase activity 
alone is insufficient for immune function as 
assayed by cell death. Although the structural 
details of catalysis are unclear, TIR nuclease 
activity and 2’,3’ synthetase activity both share 
the requirement for the putative catalytic 
glutamic acid with NADase activity, indicating 
functional overlap. 

In planta experiments that genetically re- 
duce levels of 2’,3'-cNMP also blocked TIR cell 
death, consistent with an immune signaling 
function for these cyclic nucleotide products. 
Is 2',3'-cNMP another signal that activates 
EDS1? Or does this product have other sig- 
naling functions? Accumulation of 2’,3’-cCNMP 
in planta requires the EDS1 complex, which 
would place its generation downstream of 
EDSI, at odds with data showing that TIR 
enzymes act upstream of EDS1 (27). To explain 
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this, Yu et al. suggest a feedback loop in which 
the putative 2’,3’-cCNMP EDS1-activating sig- 
nal must be amplified in an EDS1-dependent 
manner (Fig. 5) (27). Yu et al. also found that 
bacterial AbTir does not have 2',3'-cNMP syn- 
thetase activity. The lack of AbTir synthetase 
activity could potentially explain why this 
2'cADPR-producing TIR did not activate the 
EDS1-dependent plant cell death pathway as 
a chimeric fusion to the NLRC4 chassis, as 
described above (27, 108). The authors con- 
clude that plant TIRs can exist in two dis- 
tinct oligomeric complexes: as a tetramer with 
NADase activity and as a helical synthetase. 
Many details of this dual-specificity enzymatic 
system remain to be clarified. Are full-length 
TIR-NBS-LRR proteins capable of forming the 
helical synthetase given potential steric issues? 
Do alternatively spliced TIRs related to the 
truncated TIR-only isoform generate 2'3'- 
cNMP? Is there selective catalysis to produce 
both the EDS1-complex signals and 2',3’-cNMP 
products? And what are the relevant in vivo 
substrates? Immune activation leads to tran- 
scriptional activation of many TIR domain- 
encoding genes, and overexpression of these 
genes can cause cell death (39, 134-136). Thus, 
one model is that a primary TNL or TIR-only 
receptor is activated by a pathogen-encoded 
ligand to promote oligomerization that favors 
NAD* hydrolase activity; the consequent EDS1 
complex-dependent transcriptional output 
generally raises TIR domain levels through 
new transcription (Fig. 5C). These “booster 
TIRs,” in turn, drive helical oligomerization 
that favors 2'3'-cNMP synthetase activity to 
stimulate defense responses, perhaps in dif- 
ferent subcellular compartments and/or in 
cells directly neighboring the infection site 
(Fig. 5C). 


Conclusion 


The study of TIR domain proteins as an en- 
zyme family has transformed our understand- 
ing of this evolutionarily ancient signaling 
domain. The discovery of the SARM1 NADase 
has directed therapeutic approaches in neuro- 
degeneration (137-140). Constitutively hyper- 
active SARM1 NADase variants were found in 
ALS patients (741, 142), suggesting that SARM1 
alleles may contribute to disease risk in ALS. 
Hence, inhibiting SARM1 could alleviate cer- 
tain neurodegenerative diseases. The role of 
SARM1 and NAD* metabolism during devel- 
opment and aging is also emerging (143-147). 
Future studies will help strengthen this link, 
with implications for neurodevelopmental 
genetic disorders. 

Do TLRs and other animal TIR domains 
besides SARM1 truly lack enzymatic function? 
A few published reports suggest this (22, 24, 81). 
Yet some of these proteins possess the con- 
served glutamic acid residue necessary for 
NAD* cleavage in SARM1, plant TIRs, and 
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bacterial TIRs. Structural studies indicate 
these glutamate residues are in suboptimal 
positions to perform the necessary catalytic 
functions (24, 81). It is possible that mamma- 
lian TIR domains retain enzymatic activities 
but that their cognate substrates have not yet 
been identified. Prior biochemical experi- 
ments with purified proteins may have failed 
to recapitulate the true biologically active con- 
formation of these mammalian proteins and 
hence missed their enzymatic activity. 

The discovery of new cADPR isomers gen- 
erated by plant and bacterial TIR proteins 
(Fig. 3) is another advance emerging from 
the work on TIR enzymatic functions. We now 
know that there are at least two distinct 
linkages: 2'’cADPR and 3’cADPR. These small 
molecules can bind to and activate other effec- 
tor enzymes in bacterial antiphage defense 
programs. Moreover, this ability to activate 
effector proteins to trigger bacterial suicide 
can be leveraged to develop a class of anti- 
biotics that will target bacterial TIR proteins 
during infections. The structural similarities 
of the cADPR isomers also raise the possibility 
that these molecules, whether in their native 
biological context or whether repurposed as 
drugs, can function as competitive inhibitors 
of immune pathway proteins to subvert host 
immune responses. It also remains to be deter- 
mined if these cADPR isomers possess calcium- 
mobilizing effects. 

Understanding the role of TIR-derived small- 
molecule products in the plant immune system 
should clarify gaps in our understanding of 
plant immunity. Discovery of 2'-linked ribose- 
ribose small molecules as the connection 
between TIR enzymatic activity and EDS1 
complex association with downstream RNLs 
is a landmark discovery and may provide new 
natural chemicals to control disease resistance. 
Yu et al. (27) propose that the protein structures 
produced by TIR-only and TNL proteins have 
distinct enzymatic activities. How and where 
these protein structures and activities are 
occurring in vivo remains an open question for 
which we have proposed a speculative model 
(Fig. 5C). Similarly, it is unknown whether full- 
length TNL proteins (and alternatively spliced 
products) form both the tetramer NADase 
and the helical 2',3'-cNMP synthetase. Equally 
curious is the potential for separation of func- 
tions encoded by full-length TNLs and TIR 
domain-only proteins like RBA1. 

The revelation that TIR proteins are per- 
forming similar functions in cell death and 
disease resistance across kingdoms has had 
an energizing effect across many fields. Dis- 
coveries in animals, plants, and prokaryotes 
are generating hypotheses and driving research 
and applications across kingdoms as we learn 
more about the common and distinctive fea- 
tures of members of this broad and ancient 
protein family. 
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INTRODUCTION: Nucleotide-binding leucine- 
rich repeat (NLR) proteins are intracellular 
immune receptors with roles in both plant 
and animal protection against diseases. Plant 
NLRs can be divided into two broad sub- 
classes: CNLs with an N-terminal coiled-coil 
domain and TNLs with an N-terminal Toll/ 
interleukin-1 receptor (TIR) domain. Plant 
NLRs confer specific recognition of pathogen 
effectors to initiate effector-triggered immu- 
nity. NLR recognition of pathogen effectors 
induces NLR oligomerization, leading to large 
complexes called resistosomes that mediate 
effector-triggered immunity signaling. The TNL 
resistosomes, Arabidopsis RPP1 and Nicotiana 
benthamiana Rodi, act as nicotinamide adenine 
dinucleotide hydrolases (NADases) encoded 
in the N-terminal TIR domain. TNL-dependent 
immune signaling requires the conserved 
lipase-like proteins Enhanced Disease Suscep- 
tibility 1 (EDS and its two direct and exclusive 
partners Phytoalexin Deficient 4 (PAD4) and 
Senescence-Associated Gene 101 (SAG101), along 
with the helper NLR subfamilies Activated Dis- 
ease Resistance 1 (ADR1) and N requirement 
gene 1 (NRGI) of the CNL class. Multiple lines 
of evidence support the functional cooperation 
of EDS1-PAD4 dimers with ADR1 and EDS1- 


Pathogen TNL 
effector 


TNL resistosome 


—> 


SAGI101 dimers with NRG1 in conferring im- 
munity. An open question is how EDS1 dimers 
and helper NLRs integrate signals from the 
TNL resistosomes to initiate plant defense. 


RATIONALE: TIR domain NADase-catalyzed 
products are hypothesized to be perceived 
by EDS1-PAD4 and EDS1-SAG101 dimers to 
activate, respectively, ADR1 and NRGI1. How- 
ever, the molecular identities and roles of the 
presumed TIR-generated signals remain elu- 
sive. Plant TIR domain proteins are known to 
catalyze the production of multiple nucleotide- 
based compounds, making it challenging to 
identify the precise products recognized by 
EDS1 heterodimers. A TNL-triggered EDS1/ 
SAG101/NRGI signaling module was success- 
fully reconstituted in a heterologous Nb tobacco 
system. This suggests that TIR-catalyzed small 
molecules might also be synthesized in a het- 
erologous eukaryotic cell line, such as insect 
cells. We hypothesized that EDS1-PAD4 and 
EDS1-SAGI101 are receptors for specific TIR- 
catalyzed products. Thus, if a TNL resistosome 
is coexpressed with EDS1 and PAD4 or EDS1 
and SAGI01 in insect cells, then the TNL- 
generated small molecules will be likely cap- 
tured by the EDS1 dimers and identified. 


Immune response 


Allosterically 
activated receptor 


The PAD4 C-terminal domain 
(salmon) rotates ~10 degrees 


EDS1-PAD4-ADR1 
complex 


TIR-catalyzed signaling molecules activate plant immunity. Pathogen effector activation induces assembly of a 
TNL resistosome with TIR catalytic activity producing pRib-AMP and pRib-ADP. These two small molecules bind 
specifically to a pocket between EDS1 and PAD4, triggering a rotation of ~10° in the PAD4 C-terminal domain. This 
creates a fully active configuration of the ligand-binding pocket between EDS1 and PAD4 and allosterically induces 
EDS1-PAD4 association with a CNL-type helper NLR, ADR1, thereby potentiating an ADR1-mediated immune response. 


SCIENCE science.org 


RESULTS: Coexpression of the RPP1 resis- 
tosome (containing RPP1 with its cognate 
Hyaloperonospora arabidopsidis effector 
ATRI), EDS1 and PAD4 (collectively called 
RAEP), and ADR1-like 1 (ADR1-L1) resulted in 
EDS1-PAD4 interaction with ADR1-L1 in insect 
cells. A similar activity was also demonstrated 
for the monocot TIR from Brachypodium 
distachyon. The induced EDS1-PAD4 inter- 
action with ADRI-L1 was abrogated by an 
NADase catalytic mutation (E158A) of RPP. 
Compounds extracted from EDS1-PAD4 pro- 
tein purified from the RAEP system stimulated 
apo-EDS1-PAD4 binding to ADR1-L1. A crystal 
structure of the EDS1-PAD4 complex revealed 
that a small molecule, 2'-(5”-phosphoribosyl)- 
5'-adenosine diphosphate (pRib-ADP), binds 
specifically to a conserved pocket between 
EDS1 and PAD4. Liquid chromatography-high 
resolution mass spectrometry validated the 
identity of pRib-ADP and identified 2’-(5"- 
phosphoribosyl)-5’-adenosine monophos- 
phate (pRib-AMP) in the EDS1-PAD4 complex. 
As further confirmation of the two compounds, 
synthetic pRib-AMP and pRib-ADP exhibited 
potent activity in inducing the EDS1-PAD4 
interaction with ADR1-L1, but were much less 
efficient in promoting the EDS1-SAG101 inter- 
action with NRGIA. Expression of the TIR- 
only protein Arabidopsis RBA1 in Nb tobacco 
promoted the accumulation of pRib-AMP. The 
pRib-AMP/ADP-binding pocket overlaps with 
a previously identified surface groove of EDS1- 
PAD4 that is required for pathogen resistance. 
Structural comparison between pRib-ADP- 
bound and apo-EDS1-PAD4 complexes re- 
vealed conformational changes in the PAD4 
C-terminal domain, which in turn results in 
a fully active configuration of the pRib-ADP- 
binding site between the EP domains of PAD4: 
and EDS1. 


CONCLUSION: The study identifies pRib-AMP 
and pRib-ADP as TIR-catalyzed bioactive pro- 
ducts in diverse seed plants and establishes 
EDS1-PAD4 as a preferential receptor complex 
for these two small molecules. pRib-AMP and 
pRib-ADP binding involves an induced-fit mech- 
anism and allosterically induces an EDS1-PAD4: 
association with ADR1 to promote immunity 
signaling. Our data provide a missing link 
between pathogen perception and defense. 
They further suggest that pRib-AMP and pRib- 
ADP are a class of second messengers regulat- 
ing ADR1-mediated plant immunity. 
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Plant nucleotide-binding leucine-rich repeat-containing (NLR) receptors with an N-terminal Toll/ 
interleukin-1 receptor (TIR) domain sense pathogen effectors to enable TIR-encoded nicotinamide 
adenine dinucleotide hydrolase (NADase) activity for immune signaling. TIR-NLR signaling requires the 
helper NLRs N requirement gene 1 (NRG1), Activated Disease Resistance 1 (ADR1), and Enhanced Disease 
Susceptibility 1 (EDS1), which forms a heterodimer with each of its paralogs Phytoalexin Deficient 4 
(PAD4) and Senescence-Associated Gene 101 (SAG101). Here, we show that TIR-containing proteins 
catalyze the production of 2'-(5"-phosphoribosyl)-5'-adenosine monophosphate (pRib-AMP) and 
diphosphate (pRib-ADP) in vitro and in planta. Biochemical and structural data demonstrate that EDS1- 
PAD4 is a receptor complex for pRib-AMP and pRib-ADP, which allosterically promote EDS1-PAD4 
interaction with ADR1-L1 but not NRG1A. Our study identifies TIR-catalyzed pRib-AMP and pRib-ADP as a 
missing link in TIR signaling through EDS1-PAD4 and as likely second messengers for plant immunity. 


ucleotide-binding leucine-rich repeat- 

containing (NLR) proteins are intra- 

cellular immune receptors with crucial 

roles in innate immunity in plants and 

animals (7). Plant NLRs have a conserved 
organization with a central nucleotide-binding 
domain (NBD), a C-terminal leucine-rich repeat 
(LRR) domain, and an N-terminal coiled-coil 
(CC) or Toll/interleukin-1 receptor (TIR) do- 
main, referred to as CNL or TNL receptors, 
respectively. Monocot plants lack TNLs but 
encode TIR-only and TIR-NB proteins (2, 3). 
Plant NLRs mediate specific recognition of 
pathogen effectors to initiate effector-triggered 
immunity (ETI). Activated ETI signaling results 
in pathogen restriction and often a host hyper- 
sensitive cell death response at infection sites 
(4). Direct or indirect recognition of effectors 
induces NLR oligomerization (5), resulting 
in the formation of large protein complexes 
called resistosomes (J, 6, 7). The Arabidopsis 
CNL ZARI1 resistosome functions as a calcium- 
permeable channel for ETI signaling (8). By 
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contrast, the TNL resistosomes Arabidopsis 
RPP1 (9) and Nicotiana benthamiana (Nb 
tobacco) Roq1 (/0) are nicotinamide adenine 
dinucleotide hydrolase (NADase) enzymes en- 
coded by their N-terminal TIR domain (1/, 12). 
TIR NADase activity relies on the presence of a 
conserved glutamate residue that is also es- 
sential for TIR-triggered immune signaling 
(11, 12). 

Plant TIR-dependent immune signaling 
requires downstream conserved helper NLRs 
(hNLRs) of the CNL class and the lipase-like 
protein family members Enhanced Disease 
Susceptibility 1 (EDS1), Phytoalexin Deficient 4 
(PAD4), and Senescence-Associated Gene 101 
(SAG101) (13, 14). The hNLRs divide into the 
ADR1 [ADRI1, ADRI1-like 1 (ADRI-L1), and 
ADRI-L2)] and NRG1 (NRG1A, NRG1B, and 
NRGIC) subfamilies in Arabidopsis. EDS1 ex- 
clusive heterodimers with PAD4 and SAG101 
confer basal immunity and ETI responses 
(15-17). The C-terminal EP domain (shared by 
EDS1, PAD4, and SAG101) contains positively 
charged residues required for TNL signaling 
(6, 18-20). Phylogenomic (16, 27) and genetic 
(6, 17, 19, 22-25) studies suggest a functional 
partnership of EDS1-PAD4 with ADR1 and 
EDS1-SAG101 with NRGI. This is supported 
by induced associations of Arabidopsis EDS1- 
SAG101 and EDS1-PAD4 with NRGI and ADR1, 
respectively (79, 26) in planta, which are re- 
quired for pathogen resistance and potentially 
Ca?*-channel activity of hNLRs (19, 27). In the 
current model, products generated by the TIR- 
encoded NADase activity are intercepted by 
EDS1 heterodimers for activation of hNLRs 
(19, 26). Variant-cyclic ADP-ribose (v-cADPR) 
is a TIR-generated product (11, 12, 28), although 
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its molecular identity and role in the plant 
immune system remain elusive. Thus, TIR- 
catalyzed chemical products mediating im- 
mune signaling are unknown. 

Here, we identified plant TIR-catalyzed 
2’-(5"-phosphoribosyl)-5'-adenosine monophos- 
phate (pRib-AMP) and diphosphate (pRib-ADP) 
through mass spectrometry (MS) and structural 
biology. These two compounds are produced 
by expression of the RPP1 resistosome or TIR 
domain of Arabidopsis RPS4 in insect cells. 
They bind to and promote EDS1-PAD4 direct 
interaction with ADR1-L1. The same activities 
are reproduced with chemically synthesized 
pRib-AMP and pRib-ADP. These molecules 
are much less efficient in promoting the EDS1- 
SAG101 interaction with NRGIA, indicating 
preferential recognition of pRib-AMP and 
pRib-ADP by EDS1-PAD4. Structural com- 
parisons reveal that pRib-ADP binding re- 
sults in conformational changes in the EP 
domain of PAD4, enabling interaction with 
ADRI-L1. The data establish two nucleotide 
derivatives, pRib-AMP and pRib-ADP, as TIR- 
produced signals activating the EDS1/PAD4/ 
ADRI1 immunity module in plants. Identifica- 
tion of pRib-AMP and pRib-ADP as potential 
immune second messengers opens the way 
for designing small molecules to manipulate 
plant immunity. 


Reconstitution of TIR-induced EDS1-PAD4 
interaction with ADR1 in insect cells 


A TNL-triggered Arabidopsis EDS1/SAG101/ 
NRGI1 immunity signaling module was recon- 
stituted in Nb tobacco (16, 19). To facilitate 
identification of TIR domain catalytic products, 
we tested whether NRGI or ADR1 activation 
could be reconstituted in heterologous cells. 
We coexpressed Arabidopsis RPP1, its acti- 
vating pathogen effector ATR1/EDS1/PAD4 
(collectively called RAEP) and ADRI-LI1 in 
insect cells and assayed for induced EDSI1- 
PAD4 interaction with ADR1-L1 using a pull- 
down assay (16, 26). As shown in Fig. 1A (top) 
and fig. SIA, EDS1-PAD4 but not EDS1-SAG101 
copurified with ADR1-L1 in the assay, supporting 
specific and direct interaction of EDS1-PAD4 
with ADR1-L1. The interaction was dependent 
on the presence of the RPP1 resistosome (fig. 
$2). EDS1-SAG101 but not EDS1-PAD4 inter- 
acted with NRGIA when coexpressed with 
RPP1 and ATRI1 (called RAES) (Fig. 1A, bottom, 
and fig. S1A). These interactions were abol- 
ished when using an NADase catalytic mutant 
(E158A) of RPP1 (9) (Fig. 1A), suggesting that 
RPP1 TIR catalytic products promote EDS1 
heterodimer associations with specific hNLR 
subtypes. The isolated TIR domain (residues 
1 to 236) of Arabidopsis TNL RPS4 triggers 
EDS1-dependent HR cell death when expressed 
in Nicotiana tabacum plants (29). Insect cell 
coexpression of RPS4 TIR also promoted EDS1- 
PAD4 and EDS1-SAG101 binding to ADR1-L1 
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Fig. 1. TIR-catalyzed products induce the EDS1 heterodimer interaction 
with hNLRs in insect cells. (A) RPP1-ATR1 induces an EDS1-PAD4 and 
EDS1-SAGIO1 interaction with ADR1-L1 and NRGIA, respectively, in insect cells. 
N-terminally GST-tagged ADR1-L1 or NRG1A was coexpressed with nontagged EDS1, 
C-terminally strep-tagged PAD4 or SAG101, C-terminally strep-tagged RPP1 

(WT or catalytic mutant E158A), and 10xHis-tagged ATR1 in insect cells. GST-ADR1- 
L1 or NRGIA was purified using glutathione sepharose 4B (GS4B) beads. GS4B- 
bound proteins were eluted, separated by SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE), and detected by Coomassie brilliant blue staining. (B) RPS4 TIR 


(residues 1 to 236) induces an EDS1-PAD4 and EDS1-SAG101 interaction with 
ADR1-L1 and NRGIA, respectively, in insect cells. Experiments were performed as 
described in (A) except that RPP1 and ATR1 were replaced with RPS4 TIR. (€) EDS1- 
PAD4 or EDS1-SAG101 was expressed as described in (A) and purified with 
Strep-Tactin resin. The purified protein was concentrated and denatured by heating. 
After centrifugation, the supernatant was collected and incubated with separately 
purified N-terminally GST-tagged ADR1-L1 or NRGIA and apo-EDS1-PAD4 or apo- 
EDS1-SAGIO01. The mixture was passed over GS4B beads, and bound proteins were 
eluted, analyzed by SDS-PAGE, and detected by Coomassie brilliant blue staining. 


and NRGIA, respectively, which was abol- 
ished by mutation of the RPS4 TIR catalytic 
residue Glu®® (Fig. 1B and fig. S1B). These 
results show that TIR domain catalytic prod- 
ucts induce EDS1-PAD4 and EDS1-SAG101 
interactions with their cofunctioning hNLRs. 
A TIR-only protein from the monocot species 
Brachypodium distachyon (BdTIR) also elicits 
EDS1-dependent cell death when expressed in 
Nb tobacco (12). Coexpression with BdTIR in 
insect cells similarly induced EDS1-PAD4 
association with ADRI-LI (fig. S3). By contrast, 
a TIR-containing bacterial effector (HopAM1), 
which suppresses plant immune responses in 
an NADase-dependent manner (30), did not 
induce an ADRI1-L1 interaction with EDS1- 
PAD4 dimer in this assay (fig. $3). These data 
suggest that TIR proteins from unrelated seed 
plant species can directly induce EDS1 dimer- 
hNLR associations. 

We next investigated whether products of 
TIR enzyme activity bind to and support 
EDS1-PAD4 and EDS1-SAG101 interactions 
with hNLRs in vitro. For this, we coexpressed 
RAEP in insect cells. The expressed EDS1-PAD4 
complex was predicted to contain products 
catalyzed by the RPP1 resistosome. We puri- 
fied the EDS1-PAD4 protein and extracted 
the presumed small molecules by denatur- 
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ing the complex protein. The extracted small 
molecules induced interaction of separately 
purified apo-EDS1-PAD4 with ADR1-L1 (Fig. 
1C and fig. S1C). Similarly, small molecules 
extracted from the RAES-purified and dena- 
tured EDS1-SAG101 stimulated interaction 
of separately purified apo-EDS1-SAG101 with 
NRGIA (Fig. 1C and fig. S1C). These biochem- 
ical data show that enzymatic products of TIR 
proteins interact with and facilitate EDS1- 
PAD4- and EDS1-SAG101-specific interac- 
tions with hNLRs. 


Structures of TIR-catalyzed small molecules 
activating EDS1-PAD4 


We next determined the identities of the ex- 
tracted small molecules promoting the EDS1- 
PAD4 interaction with ADRI-L1 using liquid 
chromatography coupled with high resolution 
mass spectrometry (LC-HRMS). Two singly 
charged ions with mass-to-charge ratios (m/z) 
558.0657 (z = 1-) and 638.0331 (z = 1) were 
observed in the compounds extracted from 
RAEP but not control samples (fig. S4A). To 
determine the chemical structures of these 
two compounds, we purified EDS1-PAD4 by 
coexpressing RAEP in insect cells, crystallized 
the EDS1-PAD4 complex and solved its struc- 
ture. The crystal structure of the complex was 
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solved with molecular replacement using apo- 
EDS1-SAG101 (15) as the template, and was re- 
fined to 2.29 A resolution (Fig. 2A and table S1). 

In the final electron density map, there is 
a segment of unambiguous electron density 
between the partner EP domains that remains 
unoccupied after modeling of EDS1 and PAD4: 
(Fig. 2B). The ion feature with m/z 558.0657 
(z = 1) from LC-HRMS suggested adenosine 
diphosphate ribose (ADPR, molecular mass 
559.0717, 2.4 ppm mass shift) within 3 ppm 
as one of the small molecules. However, ADPR 
did not match the unfilled electron density, 
suggesting that the small molecule bound by 
EDS1-PAD4 is an ADPR isomer or isomer 
derivative. This is consistent with LC-HRMS 
analysis showing that ADPR had a retention 
time different from that of the small molecule 
with m/z 558.0657 (z = 1 ) (fig. S4B). One ADPR 
isomer, pRib-AMP, fits the electron density 
well except where a small portion connecting 
the terminal phosphate group remains void 
(fig. S5A). The ion feature with m/z 638.0331 
(z =1°), with a mass shift of 79.967 from m/z 
558.0657 (z = 1), suggested a molecule with 
an additional phosphate group on pRib-AMP. 
Adding one phosphate group to the terminal 
void density produced pRib-ADP, which fits 
into the electron density map well (Fig. 2B). 
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Fig. 2. Structural determination of the EDS1-PAD4 complex with bound 
pRib-ADP. (A) Crystal structure of EDS1-PAD4 bound by an RPP1-catalyzed 
small molecule. Highlighted within the red frame is omit electron density 
(2Fo-Fe contoured at 1.5 sigma) between the EP domains of EDS1 and PAD4. 
(B) Close-up view of the electron density highlighted in (A) with modeled 
pRib-ADP (in stick). (©) Mass analysis of small molecules bound to EDS1-PAD4. 
Top: MS/MS spectra after CID fragmentation of the ion with m/z = 638.0295 
(z = 1). Bottom: Scheme showing the proposed routes of formation of various 


A higher pRib-ADP affinity for EDS1-PAD4 
might explain why pRib-ADP was captured 
in the crystal structure of EDS1-PAD4. 

To verify the assignment of pRib-AMP and 
pRib-ADP in the EDS1-PAD4 complex, standard 
compounds were chemically synthesized (see 
data S1). MS/MS spectra and chromatographic 
elution profiles confirmed the identification of 
pRib-AMP and pRib-ADP as EDS1-PAD4 lig- 
ands (Fig. 2C and fig. S5, B to D). We tested 
whether synthetic pRib-AMP and pRib-ADP 
induce EDS1-PAD4 and EDS1-SAG101 interac- 
tions, respectively, with ADR1-L1 and NRGIA. 
Both compounds promoted EDS1-PAD4 inter- 
action with ADRI-LI, similar to that detected 
after coexpression of ADR1-L1 with RAEP (Fig. 
2D, top). The small molecules were much less 
efficient in inducing the EDS1-SAG101 inter- 
action with NRGIA (Fig. 2D, bottom). The 
synthetic pRib-AMP also promoted the EDSI1- 
PAD4 interaction with ADR1-L1 expressed in 
Nb tobacco after pRib-AMP addition to leaf 
total extracts (fig. SSE). These structural and 
biochemical data demonstrate that pRib-AMP 
and pRib-ADP are TIR-catalyzed metabolites 


Huang et al., Science 377, eabq3297 (2022) 


that preferentially induce the EDS1-PAD4 in- 
teraction with ADRI-L1. 


Recognition mechanism of pRib-ADP by 
EDS1-PAD4 


In the crystal structure, pRib-ADP adopts 
an extended conformation interacting with 
a pocket formed by the EDS1 and PAD4 EP 
domains (Fig. 3A). In support of the structural 
observation, the pRib-ADP binding pocket 
partially overlaps with positively charged EP- 
domain surfaces of EDS1 and PAD4 that are 
indispensable for EDS1 immunity signaling 
(15, 16, 20). pRib-ADP binding results in com- 
plete burial of the small molecule except for 
its terminal phosphate group (Fig. 3A). An 
extensive network of hydrogen bonds, includ- 
ing those mediated by several well-ordered 
water molecules, appears to govern binding 
of pRib-ADP to the pocket (Fig. 3B), indicat- 
ing specific recognition of pRib-ADP by EDS1- 
PAD4. The 5-phospho-beta-D-ribosyl moiety 
of pRib-ADP is recognized by EDS1 residues 
Asp*®®, Asn*”, and Tyr*”’. The phosphate 
group of this moiety also forms salt bridges 
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ions shown above. Numbers in parentheses indicate theoretical molecular 
weights of proposed ions. (D) Synthetic pRib-AMP and pRib-ADP induce a strong 
EDS1-PAD4 interaction and a weak EDS1-SAG101 interaction with ADR1-L1 and 
NRGIA, respectively. GS4B beads bound by GST-ADRI1-L1 or GST-NRGIA was 
incubated with excess purified apo-EDS1-PAD4 or apo-EDS1-SAGI101 and 
synthetic pRib-AMP and pRib-ADP on ice for 30 min. After washing, proteins 
bound to GS4B beads were eluted, analyzed by SDS-PAGE, and detected by 
Coomassie brilliant blue staining. 


with Arg?" and Arg*”® and van der Waals 
contact with Phe®®” of PAD4 (Fig. 3B). Thus, 
both EDS1 and PAD4 make substantial con- 
tributions to recognition of the ADP moiety of 
pRib-ADP. The adenine group stacks against 
PAD4: Tyr®*? at one side and EDS1 Tyr*” at the 
other, and is further stabilized by hydrogen 
bonds with EDS1 Thr**”. Besides making van 
der Waals contacts with PAD4 Lys®’/Lys®®°, 
the ADP moiety ribose group hydrogen bonds 
with EDS1 Arg”? directly and through a water 
molecule. The two phosphate groups of the 
ADP moiety form in total eight hydrogen 
bonds with their neighboring residues - five 
with EDS1 and three with PAD4 (Fig. 3B, 
right). Structure-based sequence alignments 
indicated that EDS1 and PAD4 residues in- 
volved in recognition of pRib-ADP are highly 
conserved in different plant species (fig. S6), 
suggesting that pRib-AMP and pRib-ADP 
binding is an evolutionarily conserved activ- 
ity of EDS1-PAD4. A cyclic ADPR is less likely 
to fit in the pRib-ADP-binding pocket in EDS1- 
PAD4, although the precise chemical struc- 
ture of v-cADPR (12) remains undefined. In 
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Fig. 3. Specific recognition of pRib-ADP by EDS1-PAD4. (A) Close-up view of the pRib-ADP-binding pocket 
of EDS1-PAD4. EDS1-PAD4 is shown in electrostatic surface and pRib-ADP in stick. (B) Left: EDS1-PAD4 
EP-domain amino acid coordinates for pRib-ADP. Dashed lines represent polar interactions. Purple spheres 
represent water molecules. Right: Diagram showing the interaction between EDS1-PAD4 and pRib-ADP. 
Cyan and purple frames represent residues from EDS1 and PAD4, respectively. (C) Mutagenesis analysis of 
the EDS1-PAD4 pRib-ADP binding pocket. Assays were performed as described in Fig. 1A. 


Arabidopsis, mutations in EP-domain surface 
residues EDS1 Arg*®?, Lys* and PAD4 Arg”, 
Lys*°° identified here as contacts for pRib-ADP 
binding, did not disrupt heterodimer forma- 
tion but abolished TNL effector-triggered and 
basal immunity (J6, 78, 20). Also, mutation of 
PAD4 Arg” disabled TNL-induced EDS1-PAD4 
association with ADR1 in Arabidopsis (20). 

To verify the pRib-ADP interaction with 
EDS1-PAD4 observed in the crystal structure, 
we engineered amino acid substitutions at the 
pRib-ADP binding pocket between EDS1 and 
PAD4 and tested their effects on the EDS1- 
PAD4 interaction with ADR1 induced by the 
RPPI resistosome in insect cells. Mutations of 
positively charged residues EDS1 Are*®?, PAD4 
Are?" and PAD4: Lys**° coordinating the phos- 
phate group of the pRib moiety had little effect 
on EDS1-PAD4 dimer formation but markedly 
reduced the EDS1-PAD4 interaction with ADRI- 
LI (Fig. 3C). When combined with EDS1 R425A, 
EDS1 K478A, or EDS1 T492Y, mutation of 
PAD4 Tyr®®’ stacking the adenosine moiety 
resulted in a complete loss of the interaction. 


pRib-AMP and pRib-ADP accumulation 
in plants 


We investigated whether activation of TIR- 
containing proteins leads to pRib-AMP and 
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pRib-ADP production in plants. Arabidopsis 
TIR-only protein RBA1 (37) was used because 
it promotes the EDS1-PAD4 interaction with 
ADRI-L1 in an Nb tobacco transient coexpres- 
sion assay (fig. S7) (26). LC/MS detection of 
these two compounds upon RBAI expression 
in a signaling-defective Nb tobacco epss line 
lacking all EDS1-family proteins (Vb epss) (16) 
did not produce clear signals, possibly due to 
pRib-AMP and pRib-ADP instability in leaves. 
Indeed, LC/MS assays showed that ~50% of 
pRib-AMP and >90% of pRib-ADP was de- 
graded within 1 hour of mixing with Nb epss 
tissue lysate (fig. S8). Because binding to EDS1- 
PAD4 might protect pRib-AMP and pRib-ADP 
from hydrolysis, Nb epss leaves expressing 
RBAI were co-sonicated with insect cells 
expressing EDS1 and PAD4 to detect EDS1- 
PAD4-captured pRib-AMP and pRib-ADP with 
LC/MS. Plants expressing wild-type (WT) RBAI 
produced strong pRib-AMP signals in the assay 
(Fig. 4 and fig. S9). By contrast, expression of 
an RBA1 NADase catalytic mutant (E86A) 
substantially reduced pRib-AMP accumula- 
tion (Fig. 4 and fig. S9). We failed to observe 
accumulation of pRib-ADP under the same 
conditions. This unstable small molecule might 
need to bind to EDS1-PAD4 immediately upon 
generation to evade degradation in plants. 
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These data show that pRib-AMP produced in 
TIR-activated plant tissues binds to the EDS1- 
PAD4 complex. 


Allosteric activation of EDS1-PAD4 by 
pRib-AMP and pRib-ADP 

Having established that pRib-AMP and pRib- 
ADP are ligands of EDS1-PAD4, we examined 
the receptor complex activation mechanism 
by these two compounds. For this, we purified 
the apo-EDS1-PAD4 heterodimer from insect 
cells and solved a cryo-electron microscopy 
(cryo-EM) structure of the complex at 2.93-A 
resolution (Fig. 5A, fig. S10, and table S2). The 
EDS1 structure is nearly identical to that in 
the crystal structure of pRib-ADP-bound 
EDS1-PAD4 (fig. S11A). As predicted, the 
structures of the PAD4 N-terminal lipase-like 
and C-terminal EP domains largely resemble 
their SAG101 counterparts (fig. S11B), although 
relative positioning of the two structural do- 
mains is substantially different (fig. S11C). 
Structure alignment between the free and 
pRib-ADP-bound forms of EDS1-PAD4 reveals 
that PAD4 undergoes a marked conformation 
change in its C-terminal EP domain upon pRib- 
ADP binding (Fig. 5, B and C). Compared with 
the PAD4 apo form, the PAD4 EP domain in 
its ligand-bound state is rotated ~10 degrees 
counterclockwise (viewed from the bottom) 
(Fig. 5C). This structural change leads to dis- 
solving of the central part of a long helix 
connecting the PAD4 lipase-like and EP do- 
mains, which in turn allows movement of the 
C-terminal half of the helix toward EDS1 
(Fig. 5, A and B). Similar conformation changes 
occur at the PAD4 short a-helix (@15) and 
N-terminal portion of «17. Collectively, these 
changes result in the formation of the pRib- 
ADP-binding pocket between the EP domains 
of PAD4 and EDS1 (Fig. 5B), and suggest that 
an induced-fit mechanism underlies small- 
molecule binding. Except for the terminal 
phosphate group, pRib-ADP becomes sol- 
vent inaccessible after binding to EDS1-PAD4 
(Fig. 3A). Therefore, the small molecule is un- 
likely to be directly involved in the interaction 
with ADR1-L1 but rather mediates confor- 
mational changes in the PAD4 EP domain, 
enabling ADR1 binding to EDS1-PAD4. We 
conclude that pRib-AMP or pRib-ADP binding 
allosterically activates the EDS1-PAD4 recep- 
tor complex to interact with ADR1-family 
hNLRs. On the basis of our data, we propose a 
model in which TIR-generated pRib-AMP and 
pRib-ADP induce activation of EDS1-PAD4 
to allosterically promote its association with 
ADR1 and plant immunity (fig. S12A). 


Discussion 


Plant TIR NADase-generated small molecule 
signaling intermediates proposed to activate 
immune responses have been elusive. One 
challenge has been to develop an in vitro 
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in Nb tobacco. (A) MRM analyses of pRib-AMP in Nb tobacco. Nb epss leaves — RBAI®°°4. respectively. Black, purple, and blue lines indicate the transitions 


expressing RBAI or catalytic mutant RBA1®®"* and insect cells expressing 
apo-EDS1-PAD4 complex were mixed and co-sonicated. The EDS1-PAD4 
complex in the lysate was purified through affinity chromatography, and LC/MS 
was used to analyze small molecules bound in the purified EDS1-PAD4 complex. 
The left two panels are MRM chromatograms for ADPR and synthetic pRib- 
AMP, respectively. The right two panels are representative chromatograms of 


Fig. 5. Allosteric activation of EDS1-PAD4 by A 
pRib-AMP and pRib-ADP. (A) Cryo-EM structure of 
apo-EDS1-PAD4. PAD4 lipase-like and EP domains 
are shown in green and light orange, respectively. 
(B) Structural alignment between apo- and pRib- 
ADP-bound forms of EDS1-PAD4. Highlighted with an 2 
open red frame is the pRib-ADP-binding pocket ti 
between the the EDS1 and PAD4 EP domains. EDS1 
and PAD4 in the ligand-bound form are shown in 
cyan and purple, respectively. EDS1 in apo-EDS1- 
PAD4 was used as the template for structural 
alignment. Color codes for the apo-EDS1-PAD4 

are the same as those in (A). The secondary 
structure elements labeled in white and red are from 
ligand-bound and apo-EDS1-PAD4, respectively. 

(C) Two orientations showing the structural 
alignment of the EP domains of PAD4 in apo- and 
ligand-bound forms of EDS1-PAD4. Left: the 
orientation is the same as that shown in (B). Right: 
bottom view of the structural alignment. Color codes 
for the apo- and pRib-ADP-bound forms of EDS1- 
PAD4 are the same as those in (B). The arrow shows 
that the PAD4 EP domain in its ligand-bound state 
rotates ~10° counterclockwise compared with the 
apo-PAD4 EP domain. 


560.10>136.20, 560.10>348.00, and 560.10>97.20, respectively. (B) Relative 
change in pRib-AMP accumulation in Nb epss plants expressing RBA1 or 
RBAI®° The statistics were based on LC-MS results shown in (A) and fig. S9. 
The amount of pRib-AMP in RBA1‘®° -expressing Nb epss was normalized 

to 1.0. Significance of the difference was determined using Student's t test 

(n = 3, **P < 0.01). 


Lipase-like 
domain 


~10 degrees 


biochemical assay that detects TIR-catalyzed 
bioactive molecules. Here, we reconstituted 
EDS1 heterodimer associations with hNLRs 
induced by different TIR-containing proteins 
in insect cells (Fig. 1, A and B, and fig. S3). 
Expression of Arabidopsis TIR-only protein 
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RBA1 in Nb tobacco promoted the accumu- 
lation of pRib-AMP in a catalytic activity- 
dependent manner (Fig. 4, A and B). These 
results identify pRib-AMP and pRib-ADP as 
potentially conserved small molecules for EDS1 
signaling in seed plants. This is supported by 


29 July 2022 


the high degree of conservation of pRib-ADP- 
coordinating residues in EDS1 and PAD4 
(fig. S6). Triggering of EDSI-dependent cell 
death in Nb tobacco plants by a monocot 
TIR-only protein BdTIR (72), along with 
BaTIR activity promoting EDS1-PAD4-ADR1 
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interactions in insect cells (fig. $3), strengthen 
the notion that TIRs might produce pRib-AMP 
and pRib-ADP to activate the EDS1-PAD4- 
ADR1 immune signaling branch in diverse 
seed plant species. 

The terminal phosphate group of pRib-ADP 
is solvent exposed when bound to EDS1-PAD4 
(Fig. 3A). It is therefore possible that pRib- 
ADP derivatives with covalent modifications 
to this phosphate group are also recognized by 
EDS1-PAD4. Biochemical and structural data 
reveal that Arabidopsis EDS1-PAD4 functions 
as areceptor complex for pRib-AMP and pRib- 
ADP (Figs. 1 and 3). Supporting the biological 
significance of pRib-AMP and pRib-ADP rec- 
ognition by EDS1-PAD4, mutations disrupting 
pRib-AMP and pRib-ADP binding (Fig. 3C) 
suppress EDS1-PAD4-mediated pathogen re- 
sistance (15, 18, 20). pRib-AMP and pRib-ADP 
binding induces conformational changes in 
EDS1-PAD4: to allosterically promote its asso- 
ciation with ADR1-L1 (Fig. 5). The above charac- 
teristics lead us to propose that TIR-catalyzed 
products pRib-AMP and pRib-ADP act as sec- 
ond messengers in plant immunity signaling. 
The mode of action of these two compounds is 
reminiscent of cyclic nucleotide-based second 
messengers such as cyclic GMP-AMP (cGAMP) 
in animals (32) (fig. S12B). 

In addition to mediating TNL receptor sig- 
naling, the Arabidopsis EDS1-PAD4 dimer pro- 
motes defense reprogramming downstream 
of a non-TNL-type of intracellular NLR RPS2 
(18, 33) and cell-surface immune receptor 
RLP23 (14, 34, 35), through what we identify 
here as the pRib-AMP and pRib-ADP binding 
pocket. Thus, pRib-AMP and pRib-ADP likely 
have a broad role in biotic stress signaling, 
consistent with the molecules being second 
messengers (72, 35). In further comparison to 
well-established second messengers such as 
phosphatidylinositol 3,4,5-trisphosphateinositol 
(PIP;) and cADPR (36), pRib-AMP and pRib- 
ADP convert externally generated stimuli (path- 
ogen effectors) to intracellular Ca”* signals, 
potentially by activating the Ca?*-permeable 
channel of ADR1 (27). Second messenger sig- 
naling is generally terminated by metabolic 
enzymes (37). Although the processes that re- 
move pRib-AMP and pRib-ADP are not known, 
these molecules are rapidly degraded in Nb 
tobacco extracts (fig. $8). 

The current data show that TIR-containing 
proteins catalyze the production of pRib-AMP 
and pRib-ADP, which directly activate the 
EDS1-PAD4 dimer. TIR-containing proteins 
can also encode a 2',3'-cAMP/cGMP synthe- 
tase activity promoting EDSI-dependent cell 
death (38). Thus, plant TIR-containing proteins 
appear to constitute a large family of enzymes 
producing noncanonical and nucleotide-based 
signaling molecules for regulation of immune 
and possibly other stress responses. We pre- 
viously proposed that 2',3'-cAMP/cGMP facil- 
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itate self-amplification of EDS1 signaling (38). 
An outstanding question is how TIR-generated 
pRib-AMP and pRib-ADP or their derivatives 
intersect with 2',3'-cAMP/cGMP in driving the 
plant immune response. 


Materials and methods summary 


RPP1_WsB with a C-terminal twin-StrepII tag, 
ATR1_Emoy2 with a C-terminal 10xHis tag, 
nontagged EDS1, PAD4 with a C-terminal 
twin-StrepII tag (collectively called RAEP), 
and ADRI-L1 with an N-terminal GST tag 
were coexpressed in Sf21 insect cells. The co- 
expressed proteins were purified using gluta- 
thione sepharose 4B beads, and proteins bound 
to the beads were eluted with glutathione. 
Similar methods were used to reconstitute the 
EDS1-SAGI101 interaction with NRGIA. 

The apo-EDS1-PAD4 complex protein was 
expressed using the above constructs in insect 
cells, purified using streptactin beads, and fur- 
ther cleaned by size exclusion chromatogra- 
phy. The purified apo-EDS1-PAD4 protein 
was used for cryo-EM analysis. EDS1-PAD4 
purified by coexpression of RAEP was used 
for crystallization, which was performed using 
hanging-drop vapor diffusion. All diffraction 
datasets of crystals of the protein complex 
were collected at Shanghai Synchrotron Radi- 
ation Facility (SSRF) and processed using the 
HKL2000 software package. The crystal struc- 
ture of the EDS1-PAD4 complex was deter- 
mined by molecular replacement refined by 
Phenix. 

Cryo-EM datasets of the apo-EDS1-PAD4 
complex were collected in counted superreso- 
lution mode on a Titan Krios3 (FED electron 
microscope. The contrast transfer function was 
corrected using CTFFIND4. Further process- 
ing was performed using RELION-3.1. The 
final resolution was defined on the basis of 
gold standard FSC 0.143 criteria. Local resolu- 
tion distribution was evaluated using RELION. 
Models were refined using Phenix, and model 
validation was performed with MolProbity 
and EMRinger. 

Small molecules bound in EDS1-PAD4 were 
extracted and analyzed by LC-HRMS. Data de- 
pendent acquisition was applied for MS/MS. 
Data analysis was performed using Xcalibur 
version 4.4 software (Thermo Fisher Scien- 
tific). The LC-HRMS assays of the extracted 
molecules and chemically synthesized pRib- 
AMP and pRib-ADP were performed to deter- 
mine the identities of the small molecules. 

Coexpression of RAEP with WT or mutant 
forms of EDS1 and PAD4 was performed in 
insect cells, and pull-down assays were used to 
assess the impact of mutations on the EDSI- 
PAD4 interaction with ADR1-L1. 

To determine the production of pRib-AMP 
and pRib-ADP in plants, RBAI WT and an 
RBAI catalytic mutant E86A with a C-terminal 
Myc tag were transformed into Agrobacterium 
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and transiently expressed in an Nicotiana 
benthamiana CRISPR quadruple edslapad4 
sag10lasag101b (Nb epss) mutant. Leaf pow- 
ders of RBAI WT or RBAI E86A were har- 
vested and mixed with insect cells expressing 
EDS1 and PAD4. Then EDS1-PAD4 protein 
was purified using Ni-NTA and the small mo- 
lecules bound in the complex protein were 
extracted for LC-MS analysis. 
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TIR-catalyzed ADP-ribosylation reactions produce 
signaling molecules for plant immunity 
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INTRODUCTION: Plants deploy intracellular 
nucleotide-binding leucine-rich repeat (NLR) 
receptors to detect pathogen effectors that are 
delivered to host cells during infection. Effec- 
tor recognition leads to NLR oligomerization, 
which induces effector-triggered immunity 
(ETD, often involving host cell death. The NLR 
receptor subclass called TNL (TIR-NLR) has 
an N-terminal Toll/interleukin-1 receptor (TIR) 
signaling domain. Pathogen effector-activated 
TNLs form tetrameric complexes (resistosomes) 
with nicotinamide adenine dinucleotide hydro- 
lase (NADase) activity encoded in the TIR 
domain. The NADase activity of TNLs or TIR 
domain proteins confers pathogen immunity 
and/or host cell death. Activated TNLs signal 
through conserved lipase-like proteins consist- 
ing of EDS1 (Enhanced Disease Susceptibility 1) 
and its two exclusive partners, PAD4 (Phyto- 
alexin Deficient 4) and SAG101 (Senescence- 
Associated Gene 101), together with a small 
group of conserved coiled-coil domain-containing 
helper (signaling) NLRs. In Arabidopsis, EDS1- 
PAD4 and EDS1-SAG101 dimers cooperate 


with particular helper NLR subgroups, ADR1 
(Activated Disease Resistance 1) and NRG1 
(N requirement gene 1), respectively, to induce 
immune responses. The biochemical mecha- 
nisms underlying TNL and TIR dependence 
on these two EDS1 dimer-helper NLR modules 
remain unknown. 


RATIONALE: In Arabidopsis TNL- or TIR- 
triggered immunity, EDS1-PAD4 dimers as- 
sociate with ADR1-type helper NLRs to restrict 
pathogen growth, whereas EDS1-SAG101 dimers 
interact with NRGI-type helper NLRs to pro- 
mote host cell death. Plant TNLs and TIRs 
catalyze production of several nucleotide-based 
molecules in vivo, which suggests that TIR- 
catalyzed products might activate immune 
outputs of ADR1 and NRGI. Based on similar 
but nonidentical EDS1-PAD4 and EDS1-SAG101 
surface grooves, we hypothesized that EDS1 
dimer binding of TIR NADase-catalyzed 
products induces association with their cor- 
responding helper NLRs. We identified 2’-(5"- 
phosphoribosyl)-5’-adenosine diphosphate 
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TIR-catalyzed small molecules controlling two immunity branches. Activated TIRs and TNLs use NAD* 
or NAD* with ATP as substrates to produce ADPr-ATP and di-ADPR through ADP-ribosylation reactions, which 
are likely to be further hydrolyzed to pRib-ADP and pRib-AMP. pRib-ADP and pRib-AMP and ADPr-ATP and 
di-ADPR bind specifically to EDS1-PAD4 and EDS1-SAG101 dimers, triggering conformational changes of 
PAD4 and SAG101 EP domains to allosterically induce interaction with CNL-type helper NLRs, ADR1 and NRGI, 


for plant resistance and cell death, respectively. 
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(pRib-ADP) and monophosphate (pRib-AMP) 
as the TIR-catalyzed bioactive compounds that 
bind to and induce EDS1-PAD4 interaction with 
ADR1. However, these molecules have only weak 
EDS1-SAGI101 binding activity, which suggests 
that different TIR catalytic products activate the 
EDS1-SAG101-NRGI1 immunity branch. 


RESULTS: We found that coexpression of an 
Arabidopsis TNL (RPP1) resistosome or the 
monocot TIR-only protein from Brachypodium 
distachyon with EDS1, SAG101, and NRGI in- 
duced TNL or TIR NADase-dependent spe- 
cific interaction between EDS1-SAG101 and 
NRGI1. Coupled with high-resolution mass 
spectrometry (HRMS) data, a cryo-electron 
microscopy-generated structure of TNL-activated 
EDS1-SAGI101 revealed that a small molecule, 
ADP-ribosylated adenosine triphosphate (ADPr- 
ATP), binds at a similar pocket as pRib-ADP and 
pRib-AMP to EDS1-PAD4, establishing EDS1- 
SAGI101 as a receptor for this small molecule. 
ADPr-ATP binding to EDS1-SAG101 induces a 
conformational change in the C-terminal part of 
SAGI101, which allosterically enables its inter- 
action with NRGI. This mechanism is con- 
served in pRib-ADP- and pRib-AMP-triggered 
EDS1-PAD4 binding to ADR1 and explains the 
recruitment of helper NLR types by their cor- 
responding EDS1 heterodimers. Residues co- 
ordinating small molecule binding in both 
dimers are conserved in seed plant species, 
suggesting broad relevance. TIR activation 
resulted in TIR NADase-dependent accumu- 
lation of ADPr-ATP in plant tissues. ADPr- 
ATP is synthesized by TIR-catalyzed transfer of 
ADP-ribose (ADPR) from NAD* (called ADP- 
ribosylation) to ATP. A related product, ADPr- 
ADPR (di-ADPR), with similar activity in 
inducing EDS1-SAG101 interaction with NRG1 
is formed by ADP-ribosylation of ADPR. Synthesis 
of pRib-ADP and pRib-AMP likely involves 
a two-step mechanism through TIR-catalyzed 
hydrolysis of ADPr-ATP and di-ADPR. 


CONCLUSION: TIR enzyme activity catalyzes 
ADP-ribosylation of ATP and ADPR to produce 
NAD*-derived small molecules that activate two 
distinctive EDS1 dimer-helper NLR immunity 
modules. Allosteric activation enables EDS1 
dimer association with its cofunctioning helper 
NLR. The ligands and their receptor mech- 
anisms are likely conserved across seed plants 
to regulate immune responses. 
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TIR-catalyzed ADP-ribosylation reactions produce 
signaling molecules for plant immunity 
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Plant pathogen-activated immune signaling by nucleotide-binding leucine-rich repeat (NLR) receptors 
with an N-terminal Toll/interleukin-1 receptor (TIR) domain converges on Enhanced Disease 
Susceptibility 1 (EDS1) and its direct partners, Phytoalexin Deficient 4 (PAD4) or Senescence-Associated 
Gene 101 (SAG101). TIR-encoded nicotinamide adenine dinucleotide hydrolase (NADase) produces 
signaling molecules to promote exclusive EDS1-PAD4 and EDS1-SAG101 interactions with helper 

NLR subclasses. In this work, we show that TIR-containing proteins catalyze adenosine diphosphate 
(ADP)-ribosylation of adenosine triphosphate (ATP) and ADP ribose (ADPR) through ADPR polymerase- 
like and NADase activity, forming ADP-ribosylated ATP (ADPr-ATP) and ADPr-ADPR (di-ADPR), 
respectively. Specific binding of ADPr-ATP or di-ADPR allosterically promotes EDS1-SAG101 interaction 
with helper NLR N requirement gene 1A (NRG1A) in vitro and in planta. Our data reveal an enzymatic 
activity of TIRs that enables specific activation of the EDS1-SAG101-NRG1 immunity branch. 


ell-surface and intracellular immune 

receptors detect pathogen disturbance 

and initiate signaling cascades, which 

lead to host-mediated defenses (J, 2). 

The Toll/interleukin-1 receptor (TIR) 
domain is an immune signaling module in 
many species (2). Bacterial and plant TIRs and 
human sterile alpha and TIR motif contain- 
ing 1 (hSARM1) are nicotinamide adenine 
dinucleotide (NAD*)-hydrolyzing enzymes 
(3-5). In plants, intracellular nucleotide- 
binding leucine-rich repeat (NLR) receptors 
recognize pathogen-delivered effectors to con- 
fer effector-triggered immunity (ETI (6). ETI 
transcriptionally boosts generally weaker basal 
immunity mediated by cell-surface pattern- 
recognition receptors conferring pattern-triggered 
immunity (PTD, often leading to localized host 
cell death (7, 8). A major pathogen-sensing 
NLR subclass (called TIR-NLRs or TNLs) has 
N-terminal TIR signaling domains (2, 5). Also, 
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TIR domain proteins, which lack leucine-rich 
repeats (TIRs), contribute to ETI and basal 
immunity responses (5, 9-11). The signaling 
role of TNLs and TIRs depends on their NAD* 
hydrolase (NADase) activity (3, 4, 12), which is 
promoted by effector-induced TNL oligomer- 
ization (72, 13) or TIR self-association (3, 4, 14). 
Plant immunity mediated by TNLs and 
TIRs requires the conserved lipase-like protein 
Enhanced Disease Susceptibility 1 (EDS1) and 
its sequence-related direct partners, Phyto- 
alexin Deficient 4 (PAD4) and Senescence- 
Associated Gene 101 (SAG101) (15-17), and two 
subfamilies of coiled-coil (CC)-like N-terminal 
domain helper NLRs, Activated Disease Resist- 
ance 1 (ADR1) and N requirement gene 1 (NRG1) 
(16, 18-22). Studies in Arabidopsis and Nicotiana 
benthamiana (Nb tobacco) have revealed that 
EDS1-SAGI101 cooperates with NRG1-type helper 
NLRs as a TNL ETI-induced module promoting 
host cell death (76, 23). Functional cooperation 
and pathogen-induced interaction between the 
Arabidopsis EDS1-PAD4 dimer and ADR1-type 
helper NLRs, by contrast, is associated with 
boosted basal immunity and restriction of 
pathogen growth (16, 18, 23-26). In Arabidopsis, 
these two modules are partially redundant in 
their functions, but they also display preferences 
for specific immune outputs (27). 
TIR-catalyzed small molecules induced the 
assembly of Arabidopsis EDS1-SAG101-NRG1 
and EDS1-PAD4-ADR1 complexes in insect 
cells (28). TNL- and TIR-generated 2’-(5"- 
phosphoribosyl)-5’-adenosine monophosphate 
(pRib-AMP) and diphosphate (pRib-ADP) effi- 
ciently induced EDS1-PAD4 association with 
ADRI-L1 but displayed only weak activity in 


promoting EDS1-SAG101 interaction with NRGIA 
(28). These data and the EDS1-SAG101-NRG1 
signaling preferences in vivo suggested that 
other, yet-unidentified TNL- or TIR-catalyzed 
small molecules underlie signaling through 
this immune complex. We show that TIR domain 
proteins catalyze an ADP ribose-transfer reaction 
using NAD* or NAD* with ATP as substrates 
to produce ADP-ribosylated ADPR (di-ADPR) 
and ADP-ribosylated ATP (ADPr-ATP), respec- 
tively. Like pRib-ADP and pRib-AMP (28), di- 
ADPR and ADPr-ATP can function as second 
messengers to induce TNL signaling. The latter 
two molecules specifically bind to and induce 
Arabidopsis EDS1-SAG101 interaction with 
NRGIA, establishing EDS1-SAGI101 as a receptor 
complex for di-ADPR and ADPr-ATP. Structural 
analyses reveal specific recognition mechanisms 
for di-ADPR and ADPr-ATP by EDS1-SAG101. 
Our data show that different small molecules 
catalyzed by TIR proteins selectively potenti- 
ate two distinctive EDS1-mediated signaling 
complexes and shed light on the catalytic 
mechanism underlying production of these 
small molecules. 


Identification of a small molecule specifically 
activating EDS1-SAG101-NRG1 


pRib-ADP and pRib-AMP displayed weak ac- 
tivity in inducing the EDS1-SAG101 interaction 
with NRGIA (28), which suggests that other 
TIR-catalyzed small molecules promote this 
interaction more efficiently. When coexpressed 
with the Arabidopsis TNL receptor RPP1 and 
its cognate Hyaloperonospora arabidopsidis 
effector ATR1 (29), EDS1-SAG101 was copuri- 
fied with N-terminally GST-fused NRGIA (fig. 
S1A) (28). NRGIA binding of EDS1-SAG101 
was suppressed by mutation of RPP1 catalytic 
residue Glu’*, which suggests that the EDSI- 
SAGI101 interaction with NRGI1A is conferred 
by RPP1-catalyzed products. We denatured the 
EDS1-SAGI101 protein coexpressed with RPP1 
resistosome and extracted EDS1-SAG101-bound 
small molecules. Although separately purified 
apo-EDS1-SAGI101 failed to interact with NRG1A, 
the complex gained NRGI1A- but not ADRI- 
L1-binding activity when incubated with the 
extractant (fig. SIB). Liquid chromatography 
and high-resolution mass spectrometry (LC- 
HRMS) analysis of the extractant revealed a 
chemical species with a mass-to-charge ratio 
(m/z) of 1047.0530 (g = 1) (Fig. 1A). Addi- 
tionally, a trace amount of a product with a 
molecular weight matching that of pRib-ADP 
was detected (fig. S1C), supporting weak pRib- 
ADP activity in promoting EDS1-SAG101 asso- 
ciation with NRGIA (28). 

To identify the chemical species with m/z 
1047.0530 (zg = 1), we solved a cryo-electron 
microscopy (cryo-EM) structure of EDS1-SAG101 
purified after coexpression with RPP1 and ATR1 
in insect cells (Fig. 1B, fig. $2, and table S1). A 
cryo-EM density not occupied by EDS1-SAG101 
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Fig. 1. Identification of a TIR-catalyzed small molecule activating EDS1- 
SAG101. (A) Mass analysis of the small molecule bound to EDS1-SAGI101. (Top) 
MS/MS spectra after collision-induced dissociation (CID) fragmentation of the 
ion with m/z 1047.0530 (z = 1°). (Bottom) Scheme showing the proposed routes 
of formation of various ions shown above. Numbers in parentheses indicate 
theoretical molecular weights of proposed ions. (B) Cryo-EM structure of EDS1- 
SAG101 bound by ADPr-ATP. ADPr-ATP bound between EDS1 and SAG10O1 EP 
domains is shown in transparent sphere. EDS1 and SAG101 are shown in cartoon, 
and ADPr-ATP is shown in stick. (€) Cryo-EM density around the bound ADPr- 
ATP (in stick). The map was contoured at 8.0 sigma in Pymol. (D) ADPr-ATP 
induces EDS1-SAGI1O1 interaction with NRGIA after protein expression in Nb 
tobacco. ADPr-ATP previously extracted from EDS1-SAG101 was incubated 

with a lysate of Nb tobacco quadruple mutant edslpad4sag10lab (Nb-epss) 


expressing EDSI-FLAG, SAGIO1-FLAG, and NRGIA®“*”£74.SH at 4°C for 1 hour. 
Protein-protein interaction was detected by Co-IP with antibodies as indicated. 
(E) Relative change in ADPr-ATP accumulation in Nb-epss plants expressing 
RBA1™ or RBAI®®°*. The statistics are based on LC-MS results of three 
replicates. The amount of ADPr-ATP in RBA1™°*-expressing Nb-epss was 
normalized to 1.0. Significance calculated with Tukey's HSD (honestly significant 
difference) test (n = 3, a = 0.05; different letters indicate significant differences). 
Nb-epss leaves expressing RBAI™" or the catalytic mutant RBAI®°* and 

insect cells expressing apo-EDS1-SAG101 were mixed and cosonicated. The 
apo-EDS1-SAG101 complex in the lysate was purified through affinity chroma- 
tography. Multiple reaction monitoring (MRM) analyses were performed for 
ADPr-ATP bound in the purified EDS1-SAG1O1 complex. Significance of the 
difference was determined using Student's t test (n = 3; **P < 0.01). 


is clearly discerned at the interface between the 
C-terminal EP domains of EDS1 and SAGI101 
(Fig. 1C), corresponding to where pRib-ADP 
binds in EDS1-PAD4 (fig. S3A) (28). pRib-ADP 
fitted well into half of the unoccupied cryo-EM 
density (fig. S3B), but two patches of cryo-EM 
density remained unfilled after pRib-ADP 
modeling (fig. S3B). The smaller patch, which 
connects to the terminal (8) phosphate group 
of the modeled pRib-ADP, likely results from a 
phosphate group. The other patch is much larger 
and accommodates an AMP molecule. The 
phosphate group, pRib-ADP, and pRib-AMP are 
subgroups of ADPr-ATP, with a calculated mole- 
cular mass of 1048.0496, matching 1048.0530 
from the LC-HRMS analysis (Fig. 1A, top). Fur- 
thermore, prominent ions with m/z values 
expected for product ions of ADPr-ATP were ob- 
served in the high-resolution tandem mass spec- 
trometry (MS/MS) spectrum (Fig. 1A, bottom). 

We investigated whether ADPr-ATP extracted 
and purified as described above induces an 
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EDS1-SAG101-NRGIA complex when expressed 
in Nb tobacco. EDSI, SAGIOI, and an NRGIA 
E14A/E27A cell death-inactive variant (23, 30, 31) 
were transiently coexpressed in an Nb tobacco 
epss line lacking all EDS1-family proteins (NVb- 
epss) (16). ADPr-ATP was incubated with the 
Nb-epss tissue lysate. Incubation of ADPr-ATP 
resulted in an interaction between EDS1-SAG101 
and NRGIA, which was not detected in the 
absence of ADPr-ATP (Fig. 1D). These results 
indicate that ADPr-ATP is sufficient to pro- 
mote EDS1-SAGI101 association with NRG1A. 
We next used the Nb-epss transient assay to 
investigate whether a TIR protein catalyzes ac- 
cumulation of ADPr-ATP in plants. Arabidopsis 
RBAI (9) was used for this because of its EDSI- 
dependent cell death activity in Nb tobacco 
(28). Nb-epss leaves expressing RBA1 were 
cosonicated with insect cells expressing EDS1 
and SAG101, and LC-MS was used to detect 
small molecules captured by purified EDS1- 
SAGI101. As shown in Fig. 1E, plants expressing 


wild-type RBA1 but not an RBAI catalytic 
mutant (E86A) produced strong ADPr-ATP 
signals in the assay. These data show that TIR- 
catalyzed ADPr-ATP is produced in plant 
tissues and binds to EDS1-SAGI101. 


Recognition mechanism of ADPr-ATP 
by EDS1-SAG101 


ADPr-ATP adopts an elongated conformation 
binding to a positively charged pocket between 
the EDS1 and SAG101 EP domains (Fig. 2A). 
The pRib-ADP moiety in ADPr-ATP forms a 
large network of hydrogen bonding and polar 
interactions with EDS1-SAGI101 (Fig. 2B) that 
are conserved in the structure of pRib-ADP- 
bound EDS1-PAD4 (fig. S3C). The remaining 
portions of ADPr-ATP also establish extensive 
contacts with EDSI-SAGI101. The terminal 
nucleobase adenine hydrophobically packs 
against SAGIO1 Met?™ and Leu**® at one side 
and EDSI His*” at the other side of the pocket 
(Fig. 2B). The terminal phosphate group is solvent 
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Fig. 2. ADPr-ATP-specific recognition by EDS1-SAG101. (A) Close-up view of 
the ADPr-ATP binding pocket of EDS1-SAG101. EDS1-SAG101 is shown in 
electrostatic surface, and ADPr-ATP is shown in stick. Blue, red, and white 
represent positive, negative, and neutral surfaces, respectively. (B) Diagram 
showing coordinates for interaction between EDS1-SAG101 and ADPr-ATP. Cyan 
and purple frames represent residues from EDS1 and SAGI1O1, respectively. 

Red dashed lines represent polar interactions. (C) Analysis of EDS1-SAGI101 
ADPr-ATP binding pocket mutants for induced association with NRGIA. N-terminally 


GST-NRGIA was coexpressed with nontagged EDS] and C-terminally Strep- 
tagged SAG101 (wild-type or mutant forms as indicated), RPP1, and 10xHis- 
tagged ATR1 in insect cells. GST-NRGIA was purified using GS4B beads. 
GS4B-bound proteins were eluted, separated by SDS-polyacrylamide gel 
electrophoresis (SDS-PAGE), and detected by Coomassie brilliant blue staining. 
Single-letter abbreviations for the amino acid residues are as follows: A, Ala; 

C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; N, Asn; 
P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 


exposed and only makes van der Waals contact 
with SAGI01 Lys*”. Sequence alignment showed 
that the ADPr-ATP-interacting residues of EDS1 
(fig. S4A) and SAGIO1 (fig. S4B) are conserved 
among dicot plant species. 

To verify the ADPr-ATP-activated EDS1- 
SAGI101 cryo-EM structure, we assessed the 
impact of mutations at the ADPr-ATP bind- 
ing pocket on EDS1-SAG101 interaction with 
NRGIA using the insect cell expression system 
and pull-down assay described above. Substitu- 
tion of SAGIO1 Met®™ with its PAD4 equivalent 
Arg?" substantially weakened EDS1-SAG101 
interaction with NRGIA (Fig. 2C), which ex- 
plains why ADPr-ATP was barely active in 
stimulating EDS1-PAD4 interaction with 
ADRI-L1 (fig. SIB). In Arabidopsis, SAG101 
Met®™* and PAD4 Arg®’* were essential for 
signaling through their respective branches 
and induced association with corresponding 
helper NLRs (32). By contrast, the interaction 
was only slightly affected by substitution of 
SAGI01 Asn*®° with its PAD4 equivalent 

ne?*” (Fig. 2C). Supporting a specific role 
for EDS1 His* in ADPr-ATP recognition, 
H476Y substantially suppressed AvrRps4- 
induced cell death but had little effect on 
AvrRps4-induced immunity in Arabidopsis 
(16). Although the EDS1 H476Y mutation did 
not disable EDS1-SAGI101 interaction with 
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NRGIA (Fig. 2C), combination of SAG101 
N380F and EDS1 H476Y resulted in a loss of 
NRGI1A-binding activity of EDS1-SAG101. An 
R493A mutation in EDS1 only reduced EDS1- 
SAGI101 interaction with NRGIA (Fig. 2C), in 
contrast with a loss of pRib-ADP-induced 
EDS1-PAD4 interaction with ADRI-L1 (28). 
This would explain why the EDS1 mutation 
disturbed mainly EDS1-PAD4-mediated path- 
ogen resistance in vivo (16, 25). The difference 
might be the result of more extensive contacts 
between ADPr-ATP and EDS1-SAG101, render- 
ing binding of the small molecule less sensitive 
to mutation at this position. 


Selective EDS1-SAG101 activation mechanism 
by ADPr-ATP 


ADPr-ATP specifically promotes EDS1-SAG101 
interaction with NRGIA (fig. S1B). In com- 
parison, pRib-ADP induces EDS1 heterodimer 
interactions with ADR1-L1 and NRGIA, al- 
though with different efficiency (28). pRib- 
ADP and ADPr-ATP bind to a similar position 
in the EDS1-PAD4 and EDS1-SAG101 complexes 
(fig. S3A). The pRib-ADP-contacting residues of 
PAD4 are largely conserved in SAGIO1 (fig. S3C), 
whereas SAGI101 residues for interacting with 
the ADPR moiety of ADPr-ATP are largely 
variable in PAD4 (fig. S5). Notably, substitu- 
tions of SAG101 Met*™ and Asn*®° with their 


bulkier equivalents PAD4 Are®!* and Phe?®” 
result in occlusion of the ADPR-binding site in 
EDS1-PAD4: (Fig. 3A). The different rotamers 
of PAD4 Gln!° and its counterpart SAG101 
Glu®°’ further sterically block ADPr-ATP from 
being recognized by EDS1-PAD4. These struc- 
tural observations provide an explanation for 
specific recognition of ADPr-ATP by EDS1-SAGI01. 

Superposition of an apo-EDS1-SAG101 crystal 
structure (15) with the cryo-EM structure of 
ADPr-ATP-bound EDS1-SAG101 revealed that 
the EDS1 conformation is nearly identical in 
the two forms of EDS1-SAG101 (Fig. 3B). There 
are notable structural differences in the SAG101 
EP domain but only a slight conformational 
change in the SAG101 lipase-like domain upon 
ADPr-ATP binding (Fig. 3B). Compared with 
the apo form of SAGI01, the EP domain in 
ADPr-ATP-bound SAG101 rotates anticlock- 
wise ~15 degrees (viewed from the bottom) 
(Fig. 3C). This causes complete melting of «16 
and shifts of the N-terminal sides of helices 
a14 and o15 toward EDSI, whereas SAGI01 o11 
shifts ~4.0 A away from EDSI1. These structural 
analyses indicate that ADPr-ATP binding trig- 
gers conformational changes in the SAG101 EP 
domain which enable EDS1-SAG101 interac- 
tion with NRGI1A. Like pRib-ADP bound in 
EDS1-PAD4, ADPr-ATP bound in EDS1-SAG101 
is almost completely buried, which indicates 
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Fig. 3. ADPr-ATP selection and A 
activation mechanism of EDS1- 
SAG101. (A) Structural alignment 

of EDS1-SAG101 (top) and EDS1- 

PAD4 (bottom; PDB code: 7XEY) 
around the ligand-binding sites. 

EDS1 (cyan), SAG1O1 (light 

orange), and PAD4 (pink) are 

shown in cartoon, and ADPr-ATP 
(yellow) and pRib-ADP (blue) 

are shown in stick. Three PAD4 
residues, Phe®*’, Arg?™*, and 

Asn*!®, not conserved in SAG1O1 

are shown. (B) Structural 

alignment of apo-EDS1-SAG101 

and ADPr-ATP-bound EDS1- 
SAGI101. Color codes are indi- 
cated. Secondary structura 
elements of SAGI1O1 around the 
ADPr-ATP-binding site are 
labeled. (C) Bottom view of 
the EP domain structural align- 
ment between apo-EDS1-SAG101 ° 
and ADPr-ATP-bound EDS1- 
SAGI101 shown in (B). 


EDS1 


that this small molecule is unlikely to parti- 
cipate in direct interaction with NRGIA. Thus, 
ADPr-ATP and pRib-ADP (28) use a conserved 
allosteric mechanism to activate EDS1-SAG101 
and EDS1-PAD4, respectively. 


TIR-catalyzed di-ADPR induces EDS1-SAG101 
interaction with NRG1A 


Expression of RBA1 resulted in accumulation 
of ADPr-ATP in Nb tobacco (Fig. 1E). We next 
investigated whether a purified TIR protein 
catalyzes production of ADPr-ATP when NAD* 
is used as a substrate. An isolated RPS4 TNL 
receptor TIR domain (residues 1 to 236; RPS4- 
TIR), which causes hypersensitive response 
(HR)-like cell death when expressed in planta 
(33), was used because it was easily purified. 
As shown in Fig. 4A, incubation of EDS1- 
SAGI101 with wild-type RPS4-TIR but not its 
catalytic mutant E88A in the presence of NAD* 
induced EDS1-SAG101 interaction with NRG1A. 
EDS1-SAGI101 displayed no NRG1A-binding 
activity in the absence of NAD*. These results 
suggest that RPS4-TIR-catalyzed ADPr-ATP 
production supports EDS1-SAG101 interaction 
with NRGIA. However, LC-HRMS analysis of 
extracted chemical compounds from the EDS1- 
SAGI101 protein incubated with RPS4-TIR and 
NAD* revealed a strong peak at m/z 1099.1276 
(gz = 1) (Fig. 4B) instead of the predicted m/z 
1047.0530 (zg = 1) of ADPr-ATP. Given its EDS1- 


Jia et al., Science 377, eabq8180 (2022) 29 July 2022 


ADPr-ATP 


pRib-ADP 


PAD4 


SAG101-binding activity, the unknown sub- 
stance with m/z 1099.1276 (z = 1) should be 
structurally related to ADPr-ATP. One such 
candidate is di-ADPR with molecular weight 
(1100.1255), which matches the 1100.1276 peak 
measured by LC-HRMS. Supporting this assign- 
ment, prominent ions with m/z values expected 
for product ions of di-ADPR were observed in 
the high-resolution MS/MS spectrum (Fig. 4C). 
The activity of ADPr-ATP and di-ADPR in 
inducing EDS1-SAGI101 interaction with NRG1A 
suggests that the terminal ribose sugar ring of 
di-ADPR or the y phosphate of ADPr-ATP is 
not essential for EDS1-SAG101 binding. This 
is consistent with the observation that the y 
phosphate is solvent exposed in the cryo-EM 
structure of EDS1-SAG101 (Fig. 1B). Further 
supporting this conclusion, a chemically syn- 
thesized di-ADPR enantiomer with the confi- 
guration of C-1" from the terminal ribose sugar 
ring inverted (fig. S6 and data S1) induced the 
EDS1 heterodimer interaction with NRGIA 
(Fig. 4D). By contrast, the synthetic compound 
had little activity of potentiating EDS1-PAD4 
binding to ADR1-L1 in the assay (Fig. 4D). 


Mechanism of TIR-catalyzed production of 
bioactive small molecules 


TIR-catalyzed production of ADPr-ATP seem- 
ingly results from condensation between ATP 
and ADPR. However, incubation of RPS4-TIR, 


apo-EDS1-SAG101 


ADPr-ATP-bound 
EDS1-SAG101 


EP domain 
of apo-SAG101 


EP domain of 
ADPr-ATP-bound 
SAG101 


ATP, and ADPR failed to promote EDS1-SAG101 
interaction with NRGIA (Fig. 4A). This result 
indicates that neither ADPr-ATP nor di- 
ADPR was produced in the reaction, and 
ADPR is unlikely to be the sole RPS4-TIR sub- 
strate for di-ADPR production. Given that 
NADase activity is required for TIR-induced 
EDS1-SAG101 interaction with NRGIA (Fig. 
4A and fig. SLA), we reasoned that formation 
of the 1”-2' glycoside bond in ADPr-ATP and 
di-ADPR involves NAD* hydrolysis. Incuba- 
tion of an equal molar ratio of ATP and NAD* 
with RPS4-TIR conferred EDS1-SAG101 NRGIA- 
binding activity (Fig. 4A). LC-HRMS showed 
that both di-ADPR and ADPr-ATP were pro- 
duced in this reaction, with di-ADPR being the 
major product (fig. S7A). Similar results were 
obtained when RPS4-TIR was replaced with 
the RPPI resistosome purified from insect 
cells (fig. S8). However, ADPr-ATP was pre- 
dominantly captured by EDS1-SAG101 (fig. S7B), 
which suggests a higher EDS1-SAG101 affinity 
for ADPr-ATP than for di-ADPR. 

A crystal structure of RPP1-TIR (residues 83 
to 254) solved in the presence of NAD* revealed 
that the TIR domains form a tetramer that is 
nearly identical to that in the cryo-EM structure 
of the RPP1 resistosome (fig. S9 and table S2) 
(12). Well-resolved electron densities at the two 
asymmetric dimer interfaces (Fig. 5A), where the 
ATP molecules are bound in the RPP1 (72) and 
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Fig. 4. TIR-catalyzed di-ADPR induces EDS1-SAG101 interaction with 
NRGIA. (A) RPS4 TIR catalyzes different substrates to produce small molecules, 
inducing EDS1-SAG101 interaction with NRGIA. Reaction products of RPS4 TIR 
domain (residues 1 to 236) protein with substrates indicated were incubated 
with GS4B beads bound by GST-tagged NRGIA preincubated with excess 
apo-EDS1-SAGI101 on ice for 30 min. After washing, proteins bound in GS4B 
beads were eluted, analyzed by SDS-PAGE, and detected by Coomassie brilliant 
blue staining. (B) LC-HRMS chromatograms of EDS1-SAG101-captured small 
molecules from RPS4-TIR-catalyzed products with NAD* as the substrate. 
Purified RPS4-TIR protein was incubated with NAD* at room temperature for 
16 hours. Products of the reaction were captured by apo-EDS1-SAGI101, and the 


1099.1276 

, | 

6 } =——= RPS4-TIR 

} —— RPS4-TIR+NAD* 


Intensity 


4.6 5.0 


4.4 


4.8 5.2 5.4 


Time (min) 1099.1276 


(3 (1099.1255) 


8 80 
= 
& < 
ro wo 
B80 g 8 8 
< i) B g 
Q Son = 2 2 8 fo2 
2 40 Oon~- tT MD OD © oO fo) 
5 g& = o© of 5 3 er 
& ce of ¥P Ca a ite) ot 
Oo ot Sane og wo roo} or 
oc “| = Kgl ss © 8 so 
20 N oO S a o 
+ be 2 
100 200 300 400 500 600 700 800 900 1000 1100 
m/z [M-H] 
Sean 752.0685 (752.0624) --», ---134.0476(134.0472) 
a s=5 ‘ \ 
211.0013) \ \ \ NH 
¢ ) \ 558.0656 --. NHz \ ‘ \ 2 
\ (658.0644) \ \ S 
Ha, oa ‘ 4 oN ‘ aa J 
Mn, S \ Q 4 2 \ aa NZ 
7 \\ ‘UN NZ ees 
wo be ‘ , OA eng Ly 
1H \ OH ¢ ‘ \ ‘\ ao” % 
\ wot ON wony : a we ‘OH 
\ \ ‘a 1D -- 328.0456(328.0452) 
\ “8 
\ \ \ \.-346.0565(346.0558) 
\... 887.1199 ‘ 
(887.1169) --- 426.0238(426.0221) 


‘++ 540,0551(540.0538) 


captured small molecules were extracted by denaturing the complex with 

80% methanol (v/v) and then subjected to LC-HRMS. (C) Mass analysis of 
small molecules in (B). (Top) MS/MS spectra after CID fragmentation of the ion 
with m/z 1099.1276 (z = 1). (Bottom) Scheme showing proposed routes of 
formation of various ions shown above. Numbers in parentheses indicate 
theoretical molecular weights of proposed ions. (D) A synthetic di-ADPR 
enantiomer induces EDS1-SAG101 interaction with NRGIA. The synthetic 
compound was incubated with GS4B beads bound by GST-tagged NRGIA and 
ADR1-L1 preincubated with excess apo-EDS1-SAG101 and apo-EDS1-PAD4 on ice 
for 30 min. After washing, proteins bound in GS4B beads were eluted, analyzed 
by SDS-PAGE, and detected by Coomassie brilliant blue staining. 


ROQI (73) resistosomes (fig. S9A), match well 
with ADPR, which indicates that the crystal 
structure represents an RPP1-TIR product com- 
plex. Cl” in the ribose sugar ring of ADPR faces 
the catalytic residue Glu’ (Fig. 5B), suggesting 
that this residue is likely important for catalyz- 
ing nicotinamide release from NAD*. ADPR 
binding to the asymmetric RPP1 TIR dimer is 
markedly similar to that of a substrate mimic 
to the active site of hSARM1 (fig. S9B) (34). 
These data suggest that formation of the 1”-2' 
glycoside bond in ADPr-ATP involves nucleo- 
philic attack on the C1” of the TIR-activated 
NAD* by the 2’-OH of ATP, resulting in ADP- 
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ribosylation of the in-coming ATP molecule. 
ADPR is known to be a product of TIR-catalyzed 
NAD* hydrolysis (3, 4). Thus, di-ADPR is pro- 
duced through ADP-ribosylation of ADPR. 
Our previous study had shown that TIR 
proteins can hydrolyze ATP and guanosine 
5’-triphosphate (GTP) (35). We therefore tested 
whether pRib-ADP and pRib-AMP (which 
efficiently activate EDS1-PAD4) (28) can be 
produced through RPS4-TIR hydrolysis of 
ADPr-ATP or di-ADPR (which activate EDS1- 
SAGI101). NAD*, NAD* + ATP, or NAD* + ADP 
were incubated with bacteria-purified RPS4- 
TIR and insect cell-purified apo-EDS1-PAD4 


was used to capture the products of each re- 
action. The EDS1-PAD4-captured molecules 
were detected by LC-HRMS. Both pRib-ADP 
and pRib-AMP were captured by EDS1-PAD4: 
when ATP + NAD* were the substrates (Fig. 5C). 
By contrast, much lower pRib-ADP amounts 
were captured when NAD* alone was the 
substrate. Coincubation of NAD* with ADP 
notably also promoted production of pRib-ADP 
and pRib-AMP. These data point to a two-step 
reaction in which RPS4-TIR first generates ADPr- 
ATP and di-ADPR through ADP-ribosylation of 
ATP and ADPR and then hydrolyzes the products 
to form pRib-ADP and pRib-AMP (fig. S10). 
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ribose sugar ring and one RPP1 TIR. Red dashed lin 


interactions. The C-1" atom is labeled. (©) LC-HRMS chromatograms of 


Discussion 

In this work and a related study (28), we 
discover a broadly conserved allosteric mech- 
anism for the activation of EDS1-PAD4 and 
EDS1-SAGI101 by TIR-catalyzed small molecules. 
The induced conformational change in the two 
EDS1 heterodimers promotes direct associa- 
tion with cofunctioning helper NLRs, pre- 
sumably to promote their Ca?*-permeable 
ion channel activity (30). Despite the shared 
biochemical activity, resulting EDS1-PAD4- 
ADRI and EDS1-SAG101-NRGI signaling com- 
plexes display preferences for specific immune 
outputs (16, 23, 32). This can be partly reconciled 
by the existence of multiple TIR-catalyzed 
signaling molecules with overlapping and dis- 
tinct binding specificities for the two EDS1 
heterodimers (28) (Fig. 3A). Amino acids that 
influence the selection of these small mole- 
cules by EDS1 heterodimers (Fig. 2 and Fig. 3A) 
and immunity functions in vivo (16, 23, 25, 32) 
are located at similar EDS1-PAD4 and EDSI1- 
SAG101 EP domain surfaces. Notably, an EDS1 
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es represent polar were extracted by denatu 


mutation (H476Y) that perturbed the ligand 
binding site of EDS1-SAG101 but not EDS1- 
PAD4 (Fig. 2B and Fig. 3A) abolished host 
cell death but had little effect on resistance 
to bacteria in Arabidopsis (16). Thus, specific 
immune functions of the two EDS1 hetero- 
dimers appear to be conferred by their re- 
cognized ligands. This is consistent with the 
observation that overexpression of EDS1 with 
PAD4 leads to increased Arabidopsis patho- 
gen resistance but not host cell death (24). The 
strong activity of pRib-ADP and pRib-AMP in 
promoting EDS1-PAD4 interaction with ADRI1, 
but a very low capacity for promoting EDS1- 
SAGI101 interaction with NRGIA (28), suggests 
that these TIR products might induce effective 
plant immunity without cell death. The iden- 
tified TIR-catalyzed signaling molecules are 
produced with unequal molar ratios (Fig. 5C 
and fig. S7A), probably because of differences 
in their stability (28) and production efficiency. 
In plant tissues, TIR-catalyzed small molecules 


might be subject to temporal and spatial regu- 
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6.6 


6.8 
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apo-EDS1-PAD4-captured pRib-ADP (top) and pRib-AMP (bottom) from 
RPS4-TIR-catalyzed products. Different colors represent different substrates 
of RPS4-TIR. Purified RPS4-TIR protein was incubated with the indicated 
substrates at room temperature for 16 hours. Products of each reaction 
were captured by apo-EDS1-PAD4, and the captured small molecules 


ring the complex with 80% methanol (v/v) and 


subjected to LC-HRMS analyses. 


lation to ensure that one particular set accumu- 
lates. Such regulation might be an important 
control mechanism in activating ADR1 or NRG1 
or both helper NLRs. Future analysis of small 
molecules and their receptor dynamics can help 
to develop strategies for the manipulation of 
plant immunity. 

Structure-based sequence alignments indi- 
cate that the ligand binding pockets of EDS1- 
PAD4 and EDS1-SAGI101 are evolutionarily 
conserved in different seed plant species (fig. 
S4) (28). A phylogenetic analysis revealed evi- 
dence that EDS1-PAD4 and EDS1-SAG101 
have coevolved with ADR1 and NRGI, respec- 
tively (16). Also, a functional incompatibility 
was reported between Arabidopsis and Sola- 
naceae (tomato and Nb tobacco) EDS1-SAG101 
and NRGI proteins (J6, 77). Our structural 
studies show that pRib-ADP and pRib-AMP 
binding to EDS1-PAD4 and ADPr-ATP and di- 
ADPR binding to EDS1-SAG101 almost com- 
pletely bury these small molecules and the 
conserved ligand recognition pockets (Fig. 1, 
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Fig. 2, and fig. S3) (28). These observations 
indicate that the small molecules do not di- 
rectly participate in the induced EDS1-PAD4 
interaction with ADR1 or the EDS1-SAG101 
interaction with NRGI. Rather, an allosteric 
change induced by small molecule binding in 
each dimer probably creates a new protein 
surface for specific helper NLR association. 
Although structural information on the nature 
of EDS1-PAD4-ADR1 and EDS1-SAG101-NRG1 
complexes is needed, we think that ligand- 
induced conformational changes in the PAD4 
or SAG101 EP domains are the most likely 
structural determinants for lineage-specific 
helper NLR interactions. 

Expression of a TIR-only protein from the 
monocot Brachypodium distachyon that lacks 
NRGI1 and SAGI101 causes cell death in Nb 
tobacco (3). Although PAD4 has so far not been 
found to contribute to TNL pathogen resist- 
ance and cell death outputs in Nb tobacco 
(6, 17), these results suggest that the monocot 
TIR protein (BdTIR) catalyzes the production 
of signaling molecules that activate both EDS1 
heterodimers. This was confirmed by our bio- 
chemical assays showing that BdaTIR induced 
Arabidopsis EDS1-PAD4 interaction with ADR1- 
L1 (28) and EDS1-SAGI101 interaction with 
NRGIA (fig. S11) in insect cell expression and 
pull-down assays. We speculate that TIR-catalyzed 
production of di-ADPR and ADPr-ATP might 
also be a major synthetic pathway for pRib-ADP 
and pRib-AMP in monocot plants. However, we 
cannot rule out other functions of di-ADPR and 
ADPr-ATP or monocot TIR production of other 
bioactive nucleotide-based molecules. 

The TIR-catalyzed ADPR-transfer reactions 
are reminiscent of ADPR chain extensions 
catalyzed by poly-ADPR polymerases (PARPs) 
(36), which use a strictly conserved glutamate 
residue for the catalysis (36). The conserved 
glutamate residue of TIRs may have a similar 
role in catalyzing ADP-ribosylation of ATP and 
ADPR by activating the C-1" of NAD* in the 
active site (fig. S10). C-1” of ADPr-ATP in the 
cryo-EM structure retains the same configu- 
ration as that of NAD*, suggesting that TIR- 
catalyzed ADP-ribosylation might involve a 
double-displacement catalytic mechanism, as 
shown for hSARM1 (34). However, it is pos- 
sible that TIR-catalyzed ADPr-ATP or di-ADPR 
with an inverted configuration at C-1"” cannot 
be captured by EDS1-SAG101 because of their 
deficient or weak EDS1-SAG101-binding acti- 
vity. Thus, the possibility remains that TIR- 
catalyzed ADP-ribosylation of ATP and ADPR 
also involves inversion of the configuration of 
C-1" at this position, like with PARP-catalyzed 
ADP-ribosylation of proteins. Regardless of the 
catalytic mechanism, ADP-ribosylation of ADPR 
and ATP involves both NADase and PARP-like 
activities. In addition to ATP and ADPR, ADP 
can also act as a substrate for a TIR-catalyzed 
ADP-ribosylation reaction, albeit with much 
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less efficiency (Fig. 4A and Fig. 5C). This 
raises the question whether TIRs catalyze 
ADP-ribosylation of other metabolites. Given 
the high concentrations of ATP in plant cells, 
ADPr-ATP is likely the major TIR product for 
activation of EDS1-SAGI101 in the immune re- 
sponse. Biochemical data suggested that TIR- 
mediated hydrolysis of ADPr-ATP and di-ADPR 
yields pRib-AMP and pRib-ADP (Fig. 5C and 
fig. S10, A and B), but it remains possible that 
TIRs also use other mechanisms for catalytic 
production of the two compounds in vivo. 

In addition to pRib-ADP, pRib-AMP, ADPr- 
ATP, and di-ADPR, TIRs also catalyze pro- 
duction of adenosine 2',3’-monophosphate 
(2',3'-cAMP) and guanosine 2',3'-monophosphate 
(2',3'-cGMP), which have a critical role in EDS1- 
dependent signaling (35). This is seemingly 
inconsistent with the biochemical data show- 
ing that pRib-AMP and pRib-ADP and ADPr- 
ATP and di-ADPR are sufficient to induce 
EDS1 heterodimer interactions with their co- 
functioning helper NLRs (28). However, it 
should be noted that, in vivo, all components 
have to be in place for ligands to activate re- 
ceptor signaling. An example of this is activa- 
tion of the animal NLR NLRP3, which involves 
two signals—a priming signal for transcriptional 
up-regulation of NLRP3 and pro-interleukin- 
18 (IL-18) and a second signal for induction of 
the NLRP3 inflammasome (37). Accumulating 
evidence suggests that pRib-AMP and pRib- 
ADP and ADPr-ATP and di-ADPR might not 
be sufficient to induce EDS1-dependent im- 
munity in planta. Estradiol-inducible expres- 
sion of the pathogen effector AvrRps4 fails 
to induce cell death in Nb tobacco (38), and 
AvrRps4-induced cell death can be potenti- 
ated by PTI signaling (7, 8). Supporting a role 
of PTI in TIR signaling, flg22 (a bacterial 
PAMP) treatment strongly promotes Arabi- 
dopsis TIR protein AtTN3-mediated EDS1- 
dependent cell death in Nb tobacco (39). These 
results suggest that TIR NADase activity is not 
sufficient to induce EDSI-dependent immu- 
nity. RBA1 mutants C83A and KKK“ lacking 
the 2',3’-cAMP and 2’,3’-cGMP synthetase but 
retaining the NADase activity fail to elicit cell 
death in Nb tobacco (35). We found that these 
two mutants retained activity in promoting 
EDS1-SAGI101 interaction with NRGIA (fig. 
$12). This result is consistent with the hypoth- 
esis that TIR-catalyzed products are insufficient 
for cell death activation (/4). TIR domain- 
containing genes that are up-regulated during 
early PTI were proposed to produce NADase- 
catalyzed small molecules to boost PTI signaling 
(40). Whether these TIRs and their catalytic 
products have a role in PTI potentiation of ETI 
signaling (7, 8) requires future study. It currently 
remains unknown how 2’,3'-cAMP and 2’,3’- 
cGMP cooperate with pRib-ADP, pRib-AMP, 
ADPr-ATP, and di-ADPR to potentiate EDS1- 
dependent immune responses. We speculate 


that 2',3’-cAMP and 2',3'-cGMP may up- 
regulate the expression of genes required for 
EDS1 signaling. This model agrees with a 
recent study showing that 2',3’-cAMP induces 
many stress-related Arabidopsis genes (41). 


Materials and methods summary 


Nontagged EDS1 and C-terminally twin- 
StrepII tagged SAG1O1 were coexpressed in Sf21 
cells, and the apo-EDS1-SAG101 complex was 
purified using Streptactin beads. The EDS1 
and SAGI1O1 constructs were also used to co- 
express with RPP1_WsB (C-terminally twin- 
StrepII tagged) and ATR1_Emoy2 or RPS4-TIR 
(residues 1 to 236) in Sf21 cells, and the EDSI- 
SAGI101 complex was similarly purified. Purified 
GST-NRGIA protein expressed in insect cells 
was used to pull down the apo-EDS1-SAG101 
protein or the EDS1-SAGI101 protein coexpressed 
with RPP1 and ATR1. Small molecules extracted 
from the coexpressed EDS1-SAGI101 protein were 
incubated with purified apo-EDS1-SAGI101 and 
GST-NRGIA proteins for pull-down analysis. 

For semi-in vitro ADPr-ATP-induced EDSI1- 
SAGI101-NRGI1 association analysis, EDS1-FLAG, 
SAGI101-FLAG, and NRGIAE“/¥?4_sH were 
coexpressed in Nb-epss lacking EDSI and its 
paralogs. Total protein from the plant materials 
was extracted and incubated with ADPr-ATP. 
Co-immunoprecipitation (Co-IP) analyses were 
conducted to detect EDS1-SAGI101 interaction 
with NRGIA in the presence or absence of 
ADPr-ATP. 

EDS1-SAG101 coexpressed with the RPP1 
resistosome was purified and denatured to 
extract the EDS1-SAG1010-bound small mol- 
ecules for analysis with an orbitrap mass 
spectrometer coupled with a Vanquish ultrahigh- 
performance liquid chromatography (UHPLC) 
system. A bridged ethyl hybrid (BEH) amide 
column was used for LC separation. Data- 
dependent MS/MS acquisition (DDA) with 
full scan of m/z 200 to 1500 in negative ion 
mode was performed. For detection of TIR- 
catalyzed ADPr-ATP in Nb tobacco, wild-type 
RBAI(RBAI") and RBAI™* were transiently 
expressed in Nb tobacco epss leaves. Nb tobacco 
epss leaf powders of RBAI“™ or RBAT™** were 
mixed with insect cells expressing EDS1-SAG101 
for sonication. EDS1-SAG101 was purified, the 
purified EDS1-SAG101 complex protein was 
denatured, and the extractant was used for 
LC-MS analysis with an LC MS-8060 triple 
quadrupole mass spectrometer. 

To detect TIR-catalyzed products with dif- 
ferent substrates, purified RPS4-TIR protein 
bound in Ni-NTA beads was first incubated 
with 1 ml of ddH,O containing 1 mM NAD‘, 
NAD* + ADP, or NAD* + ATP, and then with 
purified apo-EDS1-PAD4 or apo-EDS1-SAGI101. 
Then, 200 ul of each reaction solution was in- 
cubated with ADRI-L1 or NRGIA for pull-down 
analysis using Glutathione Sepharose 4B 
(GS4B) beads. The remaining solution of each 
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reaction was used for analysis with an orbitrap 
mass spectrometer. 


Cryo-EM data from frozen hydrated grids 


of the EDS1-SAG101 protein coexpressed with 
the RPP1 resistosome were collected on a Titan 
Krios3 (FEI) with K3 direct detection camera 
(Gatan). Particle picking, two-dimensional (2D) 
classification, 3D classification, and refinement 
were all performed in RELION-3.1. The crystal 
structure of apo-EDS1-SAGI101 [Protein Data 
Bank (PDB) code: 4NFU] was docked into the 
finally refined 3D reconstruction, and manual 
atomic model building was carried out in Coot. 
Phenix.real_space.refine was used to refine the 
model of EDS1-SAG101 with the final EM map. 


N-terminally GST tagged RPP1™® expressed 


in sf21 insect cells and purified by GST4B resin 
was concentrated for crystal screening. The 
RPP1"® crystals were obtained, and the crystal 
structure of RPP1™™® was determined by molec- 
ular replacement. The model from MR was built 
with Coot and subsequently subjected to refine- 
ment by Phenix_real_space_refine. 


Detailed descriptions of the total synthesis 


of the di-ADPR enantiomer can be found in 
data S1. All compounds were confirmed by 
nuclear magnetic resonance (NMR) spectra 
(data S1, NMR spectra of compounds). 
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Live-cell micromanipulation of a genomic locus 
reveals interphase chromatin mechanics 


Veer I. P. Keizer”*+, Simon Grosse-Holz***, Maxime Woringer’, Laura Zambon’*7+, 
Koceila Aizel@§, Maud Bongaerts’, Fanny Delille®, Lorena Kolar-Znika', Vittore F. Scolari”, 
Sebastian Hoffmann*q, Edward J. Banigan*, Leonid A. Mirny?*, Maxime Dahan7#, 


Daniele Fachinetti**, Antoine Coulon’?* 


Our understanding of the physical principles organizing the genome in the nucleus is limited by the lack 
of tools to directly exert and measure forces on interphase chromosomes in vivo and probe their 
material nature. Here, we introduce an approach to actively manipulate a genomic locus using controlled 
magnetic forces inside the nucleus of a living human cell. We observed viscoelastic displacements over 
micrometers within minutes in response to near-piconewton forces, which are consistent with a Rouse 
polymer model. Our results highlight the fluidity of chromatin, with a moderate contribution of the 
surrounding material, revealing minor roles for cross-links and topological effects and challenging the 
view that interphase chromatin is a gel-like material. Our technology opens avenues for future research 
in areas from chromosome mechanics to genome functions. 


ecent progress in observing and perturb- 

ing chromosome conformation has led 

to an unprecedented understanding of 

the physical principles at play in shap- 

ing the genome in four dimensions (4D) 
(2). From genomic loops and topologically as- 
sociating domains to spatially segregated A/B 
compartments and chromosome territories, 
the different levels of organization of the eu- 
karyotic genome are thought to arise from 
various physical phenomena, including phase 
separation (2-4), ATP-dependent motors (4, 5), 
and polymer topological effects (6). Nonethe- 
less, the physical nature of chromatin and chro- 
mosomes inside the nucleus and its functional 
implications for mechanotransduction remain 
an active area of investigation (7, 8). Observation- 
based studies assessing the mobility of the 
genome in living cells, from single loci (9-17) 
and small regions (12) to large domains (13), 
underline the possible existence of different 
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material states of chromatin (e.g., liquid, solid, 
and gel-like). Extranuclear mechanical pertur- 
bations, including whole-nucleus stretching 
(14, 15), micropipette aspiration (16), and ap- 
plication of local pressures (16, 17) or torques 
(78) onto a cell, all affect the overall geometry 
of the nucleus and reveal global viscoelastic 
properties. Conversely, intranuclear mecha- 
nical manipulation of the genome is rare and 
technically challenging (8). Viscoelasticity mea- 
surements using a microinjected 1-um bead 
suggested that interphase chromatin may be a 
cross-linked polymer network (i.e., a gel) (9). 
Recently, intranuclear mechanics were ele- 
gantly probed by monitoring the fusion of 
both artificial (20) and naturally occurring (27) 
droplet-like structures. Active mechanical man- 
ipulation of an intranuclear structure was 
recently achieved using an optical tweezer to 
displace a whole nucleolus in oocytes (22) and 
using optically induced thermophoretic flows 
within prophase (23) or interphase (24) nuclei. 
However, these approaches are limited to the 
manipulation of large structures or do not ap- 
ply forces directly on chromatin. These limi- 
tations have made it difficult to disentangle 
various effects (e.g., mechanical response of 
the nucleus versus chromatin itself or hydro- 
dynamics versus polymer viscoelasticity), lead- 
ing to contradictory results. Therefore, an 
approach for the direct and active mechanical 
manipulation of specific genomic loci inside 
living cells is needed. To meet this need, we 
developed a technique for targeted microman- 
ipulation of a specific genomic locus in the 
nucleus of a living cell, allowing us to probe 
the physical properties of an interphase chro- 
mosome by measuring its response to a con- 
trolled point force. 


Results 

Mechanical manipulation of a genomic locus in a 
living cell 

Our approach relies on tethering magnetic 
nanoparticles (MNPs) to a genomic locus and 
applying an external magnetic field (Fig. 1A). 
We chose ferritin MNPs for their small size 
(25, 26): 12 nm in diameter for ferritin (PDB 
1GWG) and 28 nm for the full MNP (26). We 
produced ferritin MNPs by synthesizing in vitro 
recombinant enhanced green fluorescent pro- 
tein (eGFP)-labeled ferritin cages and loading 
them with a magnetic core (see the materials 
and methods). We microinjected MNPs into 
the nuclei of living human U-2 OS cells pre- 
viously engineered to contain an artificial ar- 
ray inserted at a single genomic location ina 
subtelomeric region of chromosome 1 (band 
1p36) (27). This genomic array contains ~200 
copies of a 20-kb genetic construct, each in- 
cluding 96 tetO binding sites and a transgene. 
It has been extensively used in the past to 
study the function of several chromatin mod- 
ifications, RNA polymerase II (Pol II) recruit- 
ment, and RNA synthesis during induction of 
the transgene (27-29). Therefore, although we 
used it here uninduced, this array can recapit- 
ulate functional chromatin-based processes 
such as transcriptional activation. MNPs were 
targeted to the array using a constitutively 
expressed fusion protein (TetR, mCherry, and 
anti-GFP nanobody) serving as a tether (Fig. 
1A). Upon injection, MNPs diffused through 
the nucleus and accumulated at the array, 
forming a fluorescent spot in both eGFP and 
mCherry channels (figs. SIA and S2A). Quan- 
tification of the fluorescence signals indicated 
that MNPs were at nanomolar concentrations 
in the nucleus after injection and accumulated 
at the genomic locus in the range of hundreds 
to thousands of MNPs (median 1500 MNPs; 
figs. SIB and S3, movie S1, table S1, and mate- 
rials and methods). The locus should be re- 
garded as a condensed and heterochromatic 
4-Mb region (1.6% of chromosome 1) resid- 
ing in a euchromatic genomic context, as pre- 
viously reported (27), with small MNPs (each 
being ~2 to 3 times (26) the size of a nucleo- 
some) sparsely decorating chromatin (1 MNP 
per ~2.7 kb). Consistently, we observed that 
the locus typically resided in low to interme- 
diate DNA density regions and was itself rela- 
tively condensed (fig. S4, A and B) and that 
binding of MNPs to the locus did not sub- 
stantially affect its morphology (fig. $2, A and 
B). Microinjection and attraction of unbound 
MNPs did not substantially alter chromatin 
distribution and densities inside the nucleus 
[as assessed by silicon rhodamine (SiR)-DNA 
staining; fig. $2, C and D]. Cells were imaged 
on a coverglass with custom-made microfab- 
ricated pillars (30, 31) that behave as local 
magnets only when subjected to an external 
magnetizing field (fig. S5). Therefore, ON/OFF 
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modulation of the local force field could be 
achieved while imaging by placing and remov- 
ing an external magnet on the microscope 
stage. The shape and orientation of the pillars 
were chosen to maximize the magnetic field 
gradient and hence the force. We performed 
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Fig. 1. Mechanical micromanipulation of a genomic locus in living cells. 

(A) MNPs of GFP-labeled ferritin were microinjected into the cell nucleus and 
targeted to a genomic array containing ~19,000 tetO binding sites (27) with a linker 
protein. Cells were imaged on a coverslide with microfabricated magnetic pillars 

that produce a local magnetic field and attract the genomic locus. (B) The force 
exerted onto the locus depends on its position relative to the pillar and is characterized 
using a precalculated force map (see the materials and methods), here shown for 
1000 MNPs at the locus. (€) Example of a pull-release experiment showing the locus 
being displaced during the 30 min of force exertion and recoiling during the 30 min 
of force release (30'-PR scheme). (See also movie S3.) (D) Kymograph of the same 
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magnetic simulations and experimental cal- 
ibrations using two independent methods (see 
the materials and methods, figs. S6 and S7, 
and movie S2) to determine the magnitude 
and orientation of the force applied onto the 
genomic locus as a function of the number of 
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MNPs bound to it and its position relative to 
the magnetic pillar (Fig. 1B). The typical forces 
applied onto the locus were in the sub- 
piconewton (pN) range, occasionally reaching 
a few piconewtons (table $1; median force = 
0.45 pN). These values are in the range of 
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experiment showing each time frame, along with the force time profile calculated using 
the force map. (E) Experiment in which pulls and releases are 100 s and the pull- 
release cycle is repeated 10x (100"-PR scheme). Images are time projections; shown 
in green are all positions of the center of mass of the locus over the periods 
represented on the timeline. The arrows indicate the direction of the motion. (See also 
movie S4.) (F) Kymograph of the same experiment showing the displacement of 

the locus and the spatial patterns of DNA density in the nucleus, along with the force 
time profile. All SIR-DNA images are band-pass filtered (see the materials and 
methods). For (D) and (F), dotted lines indicate the nuclear periphery and white arrows 
features of interest in the spatial distribution of DNA density. 
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forces exerted by molecular motors in the nu- 
cleus, e.g., comparable to the stalling force of 
~0.5 pN for the structural maintenance of 
chromosomes (SMC) complex condensin (32) 
and a few piconewtons for Pol II (33). 


Force-induced movement of a genomic locus 
reveals viscoelastic properties of chromatin 


We first applied the magnetic force for 30 min 
and released it for another 30 min while per- 
forming low-illumination three-dimensional 
imaging with a 2-min interval (30'-PR scheme). 
We observed a clear motion of the locus 
toward the magnet upon application of the 
force and a slow and partial recoil when the 
force stopped (Fig. 1, C and D, and movie $3). 
This indicates that a sub-piconewton force, 
when applied in a sustained and unidirectional 
manner on a genomic locus, elicits a displace- 
ment of that locus by several micrometers over 
minutes. It also shows that the chromosomal 
locus can move across the nuclear environ- 
ment, which is believed to be crowded and 
entangled. We also applied the force period- 
ically, pulling for 100 s, releasing for 100 s, and 
repeating this cycle 10 times (100"-PR scheme) 
while performing fast two-dimensional imag- 
ing with a 5-s interval (Fig. 1, E and F, and 
movie S4). Several observations from these 
two experiments hinted at the material prop- 
erties of chromatin. First, the trajectories 
showed recoils during release periods and a 
gradual slowdown during both pulls and re- 
leases, characteristic of a viscoelastic material. 
Second, spatial heterogeneities in the trajecto- 
ries were visible and appeared to relate to the 
spatial distribution of DNA density (the motion 
of the locus was occasionally hindered where 
the DNA density varied; Fig. 1, D and F, white 
arrows). Third, recoil after force release was 
seen even after collision with the nuclear peri- 
phery, indicating that the material there (peri- 
pheral heterochromatin and the nuclear lamina) 
was not sticky enough to fully retain the locus. 
Fourth, the spatial distribution of DNA density 
in the nucleus did not show large-scale de- 
formations, indicating that the locus did not 
drag along large amounts of material (movies 
83 and S4). The force-induced displacements 
that we observed are consistent with visco- 
elastic and nonconfining chromatin and con- 
stitute a basis to further develop and test 
physical models of interphase chromosomes. 


Quantitative force-response and scaling laws of 
interphase chromatin mechanics 


To quantify the viscoelastic properties of chro- 
matin, we analyzed the trajectories of the 
locus in 35 cells undergoing the 30’-PR scheme 
(corrected for cell motion and force orienta- 
tion; see the materials and methods). We ob- 
served a range of behaviors in both pulls and 
releases regarding initial speed, total distance 
traveled, and shape of time profiles (Fig. 2, A 
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and B, and fig. S8). Most traces showed a dis- 
placement that was clearly distinguishable 
from diffusion (Fig. 2, A and B, hatched areas; 
see the materials and methods). Collision with 
the nuclear periphery (Fig. 2, A and B, open 
symbols, and fig. S8, open symbols) was seen 
in nine of 35 traces, so the total displacement 
during the pull is most often not limited by the 
nuclear periphery. The initial force applied 
onto the locus largely predicted the variability 
seen in the initial motion (Fig. 2C and fig. 
S9A). The recoil motion after force release 
was in part predicted by the total distance over 
which the locus had been displaced during the 
pull (Fig. 2D and fig. S9B), with a simple linear 
relationship highlighting the elastic nature of 
chromatin. Deviations from these simple pro- 
portionality relationships indicate that the 
specific nuclear context or the state of the 
genomic locus might influence its response. 
In particular, we observed that when the locus 
moved more slowly than expected, it was less 
DNA dense, and when the locus moved faster 
than expected, it was more DNA dense (fig. 
S4C), suggesting that the compaction state 
of the locus itself affected its response to the 
force. The absolute nuclear position of the lo- 
cus did not correlate with its response to the 
force, but if the locus reached the periphery 
during the pull, it often recoiled more slowly 
than expected (fig. S10). Despite the varia- 
bility between traces, log-log plots of all the 
pulls and releases from the 30’-PR and 100”-PR 
trajectories, together with three additional 
high-frame-rate (dt = 0.5”) pull-release trajec- 
tories, revealed linear portions in the curves 
with a slope of 0.5 over more than three orders 
of magnitude in time (Fig. 2E). This behavior 
suggests that the different levels of the hier- 
archical genome organization are not charac- 
terized by vastly distinct mechanical properties. 
In addition, displacements that scale with 
time as ¢”” can be empirically described by a 
“fractional speed,” i.e., a single value in m/s”? 
capturing how the motion evolves over time 
(Fig. 2, C and D, and fig. S9, right axes). The 
first pull of the 100”-PR trace, represented in 
this unit, indeed followed the same relation- 
ship as the 30’-PR traces (Fig. 2C, dark green 
triangle), and the slope of the resulting force- 
displacement plot yielded a unique factor of 
0.158 (+0.014) um/s”?/pN, characterizing the 
dynamic response of chromatin to force. These 
results indicate that a large part of the re- 
sponse of chromatin to force can be described 
by simple laws. 


The chromatin force response is well described 
by a free polymer model (Rouse chain) 


We then sought a model of chromatin that 
best explains our quantitative measurement 
of force-induced locus displacement. Several 
features in our data suggested a classical poly- 
mer model known as a Rouse polymer (34) as 


a first approximation to describe the response 
of chromatin to forces. The Rouse model rep- 
resents a polymer in which each monomer 
diffuses by thermal motion in a viscous me- 
dium and is connected to its two neighbors 
by elastic bonds. Rouse ignores steric effects 
(contact, hindrance), cross-links (affinity, stick- 
iness), and topological effects (fibers can pass 
through each other). This model is frequently 
invoked for chromatin dynamics because it 
predicts the characteristic power-law scaling- 
that is, a linear relationship on a log-log plot-of 
the mean squared displacement (MSD) versus 
time with exponent 0.5, as observed here (fig. 
S11, A and E) and for other genomic loci in 
eukaryotes (35-37). We extended Rouse theory 
to study how a polymer responds to a point 
force (see the materials and methods and the 
supplementary text). Our calculations pre- 
dict a power-law behavior with exponent 0.5 
for displacements and recoils in response to 
force, consistent with our experimental obser- 
vations (Fig. 2E). These two power laws have 
the same physical origin, so the diffusion co- 
efficient obtained independently from the 
MSD (1627 + 19 nm” s~””; fig. SILA) directly 
relates to—and predicts—the slope of the 
force-displacement plot (Fig. 2C, red line): 
1627 nm? s¥?/2kpT = 0.190 + 0.003 um/s”?/pN 
(see the materials and methods). This agree- 
ment between two independent passive and 
active measurements (Fig. 2C and fig. S9A, red 
and gray lines, representing diffusion and force 
response, respectively) supports the Rouse 
model to explain our chromatin dynamics 
data. Inspected on a cell-by-cell basis, the 
force-free MSD of the locus before and after 
the pull-release experiments appeared very 
moderately reduced in most cases (fig. S11B). 
Its natural variability between cells does not 
appear to explain the variability of the re- 
sponse to force (fig. S11C). After force release, 
the Rouse model also predicted a recoil pro- 
portional to the total displacement during 
the pull. However, in many cases, the locus re- 
coiled somewhat more slowly than predicted 
by the Rouse theory. Instead, the theoretical 
prediction appears to define an upper bound 
for the recoils (Fig. 2D and fig. S9B, red lines), 
and deviations from Rouse theory were more 
pronounced at the nuclear periphery (fig. SIOB). 
This analysis suggests that the dynamic re- 
sponse of the chromosome to the force can be 
described by the Rouse polymer model, with 
additional effects from the nuclear environment. 


Model-based trajectory analysis reveals 
moderate hindrance by surrounding chromatin 


To further understand the physical nature of 
chromatin, we investigated how alternative 
polymer models are able to capture the 100”-PR 
trace (Fig. 3, A to D). Our approach was to use 
the displacement trajectory and infer, assum- 
ing a given polymer model, the time profile of 
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Fig. 2. Quantitative analysis of locus movement in response to force. 

(A) Trajectories of the genomic locus in the direction of the applied force for 
35 different cells during force exertion with the 30'-PR scheme. A selection of 
trajectories, representative of the breadth of observed behaviors, are 
highlighted and color-coded by force (see fig. S8 for individual trajectories). 
Hatched areas in (A) to (D) correspond to the null model of pure diffusion 
SD measurement (see the materials and methods). (B) Recoil 
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position at the moment of force switching, are represented as log-log plots fo! 


indicates the expected relationship from Rouse theory, solely based on an MSD 
measurement. (D) Recoil after At = 5 min of force release on all the traces from (B) 
and the last release of the 100"-PR trajectory (R10, green triangle), plotted against the 
total displacement during the pull. The red line indicates the expected relationship 
see fig. S9 for versions of (C) and (D) at different At and in linear 
scales]. Open symbols in (A) to (D) indicate when loci are within 1.5 um of the nuclear 
periphery [at the moment of measurement in (A) to (C) and at the moment of force 
trajectories relative to the time and position at the start of the release are release in (D)]. (E) Displacement and recoil trajectories, aligned on the time and 
shown for the same loci as in (A). Curve R10 is the last release of the 100"-PR 


pull- 


trajectory. (C) Displacements measured at At = 5 min of force exertion on all the traces —_ release experiments imaged with different frame intervals: dt = 0.5 s (see the materials 
from (A) plotted against the magnitude of the force. Coordinates are interpolated and methods), dt = 5 s (from the 100"-PR trace; Fig. 1F), and dt = 2 min [all 30'-PR 
between the frames before/after At. The green line and triangle correspond to the traces; (A) and (B)]. For the latter, average trajectories (right plots) were calculated 


envelope of the 100"-PR trajectory. Reported forces are the average over At. 


over all displacements where the applied force remains <2 pN (28 of 35 traces) 


Displacements are also expressed in um/s” (right axis), allowing us to place pull Pl —_and over all the recoils after a displacement of <5 yum (21 of 35 traces). Red dotted 
from the 100"-PR trace, measured at At = 100 s (dark green triangle). The red line lines indicate the power-law behavior, with exponent 0.5, predicted by Rouse theory. 


the force that produced the measured trajec- 
tory (see the materials and methods, supple- 
mentary text, and fig. S12, A to C). Disagreement 
between inferred and actual force profiles 
indicates when models are incorrect or in- 
complete, allowing one to select and refine the 
best model(s). With this approach, we com- 
pared a series of models (fig. S12C). First, a 
simple Rouse model without any adjustable 
parameters (i.e., calibrated using the MSD 
versus time plot; fig. S11A) predicted well the 
first pull and all of the release periods (Fig. 
3B). However, the prediction leaves some of 
the applied force unexplained (Fig. 3B, gray 
area between curves), suggesting a missing 
component in the model that would addi- 
tionally slow down or hinder the progres- 
sion of the locus. This residual unexplained 
force did not scale with speed and thus could 
not be explained as an additional viscous drag 
on the locus (fig. S12D). Instead, it increased 
progressively across successive pulls, suggest- 
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ing an accumulation of hindrance as the locus 
moved through the nucleus. To represent this, 
we added a capacity for the locus to interact 
with the surrounding chromatin, represented 
as extra Rouse chains that are either attached 
to or pushed by the locus along its path (Fig. 
3C and fig. S12C). These models better pre- 
dicted the force profile throughout the trajec- 
tory compared with a pure Rouse model. The 
only free parameter used for the inference shown 
in Fig. 3C is the frequency at which the locus 
interacts with other polymers, which we found 
to be very low (fig. S12C), indicating that the 
interaction with the surrounding chromatin 
was moderate. This is also consistent with the 
small but detectable reduction in mobility of the 
locus before and after pull-release experiments 
(fig. S11B) and the subtle redistribution of 
DNA densities around the pulled locus (fig. 
S4D). These modeling results suggest that, 
upon force application and release on our geno- 
mic locus, chromatin is well described as a 


Rouse polymer (i.e., a free polymer in a viscous 
environment), with moderate interactions from 
the surrounding chromatin, indicating that 
hindrance, cross-links, and topological effects 
play a minor role. 


Interphase chromatin does not behave as a gel 
in force-response experiments 


Interphase chromatin has been proposed to 
be a gel-like material (11, 12, 19). A gel is a 
highly cross-linked polymer, which means that 
unlike a linear polymer, in which monomers 
are linked to two neighbors, extra links be- 
tween nonadjacent monomers form an inter- 
connected mesh, giving the gel solid-like 
properties. For chromatin, this could in prin- 
ciple arise from affinity between nucleosomes, 
as well as loops or bridges formed by proteins, 
complexes, and condensates and topological 
entanglement between chromatin fibers. How- 
ever, in such an interconnected mesh structure, 
short paths effectively linking the pulled locus 
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Fig. 3. Model-based analysis and hypothesis testing. (A) Trajectory of the 
locus shown in the direction of the force (green curve) and orthogonal to the 
force in the imaging plane (blue curve) for the 100"-PR experiment. Arrows 
indicate apparent obstacle [see asterisks and hash marks 
(B to D) Evaluation of different models through their capacity to reproduce 
the experimentally measured force time profile (orange curve) by inferring i 
from the trajectory (gray curve). Models shown here are a simple Rouse 
polymer (34) (B), the same model with extra polymer chains being pushed by the 
locus to represent the surrounding chromatin (C), and a ge 
represented as a Rouse polymer in a viscoelastic environment (D). (See full list i 
fig. S12.) (E) Residual unexplained force from the second model [area between 
curves in (C)] plotted along the trajectory of the locus, highlighting an obstacle 


to all other loci in the nucleus would result in 
long-range deformation of the spatial pattern 
of DNA density, which we did not observe 
(Fig. 1, D and F, and movies S3 and S4). Fur- 
ther, if the chromatin surrounding the locus 
were gel-like, then it would effectively act as 
a viscoelastic medium. This assumption does 
not recapitulate well the experimental data 
(even with two free parameters; Fig. 3D and 
fig. S12C) and is inconsistent with the ob- 
served scaling of 0.5 in the MSD (fig. S11, A 
and E), which argues for a simply viscous and 
nonelastic medium. Finally, if the locus were 
part of an interconnected mesh, then short 
series of links would tether it to large struc- 
tures (e.g., the periphery and nucleoli). A 
Rouse model that includes a finite tether does 
not recapitulate the experimental data (fig. 
$12C) and is inconsistent with the linear be- 
havior observed in Fig. 2E up to several mic- 
rometers. These results suggest again minor 
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effects of cross-links and topological constraints 
and argue against the view that interphase 
chromatin behaves like a gel at the spatial and 
temporal scale of our observations. 


Heterogeneities in the trajectory reveal obstacles 
in the nuclear interior and a soft elastic material at 
the nuclear periphery 


Even the models that best capture the data 
leave part of the force unexplained (Fig. 3C, 
gray area). We plotted this residual unex- 
plained force as a function of spatial position 
(Fig. 3E). This revealed an accumulation of 
non-null residual forces at specific locations, 
matching visible features in the trajectory and 
in the spatial distribution of DNA density in 
the nucleus. First, the residual force in pulls P3 
to P5 corresponds to an apparent obstacle in 
the trajectory (Fig. 3, A and C, asterisks) oc- 
curring at a high-to-low transition of DNA den- 
sity (Figs. 3F and 1F). It appears as a spatially 


ee along F (um/s) (¢ forward, o backward) 


(asterisk) and an elastic region near the nuclear periphery (hash mark) for which 
physical parameters are measured (see the materials and methods) and which 


(F). (F) Time projection images before the first pull and 


during pulls P4 and P8 showing how the spatial distribution of DNA density in 
the nucleus relates to the identified obstacles. SIR-DNA images were band-pass 
filtered (see the materials and methods). (G) Hypotheses on how the lateral 
mobility of the locus may change depending on its force-induced displacement. 
(H and I) MSD of the lateral movement of the locus calculated as a function 
velocity in the direction of the force. Solid lines on both 
the MSD-velocity (I) representations correspond to a 
single-parameter fit describing how lateral mobility increases with velocity in the 


defined barrier of residual force (Fig. 3E), re- 
quiring an energy of ~ 46 kT to overcome. 
This suggests that, whereas DNA dense regions 
are not obstacles per se, the interface between 
high- and low-density regions may constitute 
a barrier. The energy that we estimated sug- 
gests that such barriers may be overcome by 
ATP-dependent molecular motors (32, 33) but 
not likely by spontaneous thermal fluctuations. 
Second, the residual force in pulls P8 to P10 
(Fig. 3, A and C, hash marks) corresponds to 
the collision with structures near the nuclear 
periphery (Figs. 1F and 3F, hash marks). The 
observed linear force-distance relationship 
(Fig. 3E) indicates a solid-like elastic behav- 
ior for these structures over at least 600 nm 
and with a spring constant of 4.81 pN/um. This 
is much softer than what was measured by 
whole-nucleus stretching experiments (14, 15), 
which could be explained by the small size of 
the locus and/or the existence of a soft layer of 
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elastic peripheral components (e.g., hetero- 
chromatin and nuclear lamina) rather than 
the material directly contributing to the struc- 
tural rigidity of the nucleus. 


Lateral mobility of the locus reflects transient 
collisions with obstacles in the nucleoplasm 


To further investigate the material encoun- 
tered by the locus, we analyzed the lateral mo- 
tion of the locus as it was being pulled and 
released (Fig. 3A, blue curve). We hypothe- 
sized that collisions with obstacles could in- 
crease lateral mobility or, conversely, that the 
locus being dragged into a more constraining 
and entangled environment could result in a 
reduction of its mobility (Fig. 3G). After com- 
puting the MSD of the lateral motion as a 
function of both time delay t and velocity v, 
along the direction of the force (Fig. 3, H and 
I, and fig. S11, F and G), we observed a clear 
increase of lateral mobility when the locus 
moved (for both forward and backward move- 
ments; Fig. 31), suggesting the existence of 
obstacles that deflected the motion. This ad- 
ditional mobility in the MSD is captured by a 
term proportional to v,, as would be expected 
for collisions, and proportional to t (not 1°), 
as would be expected if the force due to the 
collision with obstacles persists in the same 
direction across several frames, indicating the 
existence of large obstacles. Indeed, in P3 for 
instance, the lateral motion clearly shows a 
directional behavior (Figs. IE and 3A). How- 
ever, the relationships that we observed (Fig. 3, 
H and J) held even when excluding all of the 
time points before P4 (fig. S11, H and J), indi- 
cating that the collision with obstacles was 
widespread throughout the nucleus. These re- 
sults, together with our observation that very 
few chromatin fibers appeared to be carried 
along with the locus, indicate that obstacles 
are frequently encountered by the locus, but 
most interactions are weak and transient. 


Discussion 


Our measurements of how a genomic locus 
inside the nucleus of a living cell responds to a 
point force indicates that interphase chroma- 
tin has fluid-like properties and behaves as a 
free polymer. This contrasts with previous 
studies depicting chromatin as a stiff, cross- 
linked polymer gel with solid-like properties 
(11, 12, 19). Our observation that near-piconewton 
forces can easily move a genomic locus across 
the nucleus over a few minutes (Fig. 1, D and 
F) also contrasts with a previous study report- 
ing confined submicrometer displacements 
over seconds upon application of 65 to 110 pN 
forces to a 1-um bead (19). We propose that our 
results may be reconciled with previous ex- 
periments in several ways. First, unlike a 
micrometer-sized bead, the locus used in 
our experiments is small and may be deform- 
able enough to pass through the surrounding 
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chromatin. Second, chromatin may contain 
many small, gel-like patches embedded in a 
structure with liquid, Rouse-like properties 
at a larger scale. This is also consistent with 
our observation that the transiting locus fre- 
quently encounters obstacles. Third, chroma- 
tin may be a weak gel, i.e., a gel with short-lived 
cross-links (77). Such a gel could continuously 
maintain a stiff, globally connected network 
that resists stresses over large length scales 
while permitting fluid-like motions at smaller 
scales. Future experiments perturbing chro- 
matin state and chromatin associated proteins 
will be important to reconcile observed micro- 
and mesoscale mechanics. 

Organization of chromosomes that allows 
movement of genomic loci across large dis- 
tances by weak forces could have implications 
for a range of genome functions. For example, 
large-scale movements of chromosomes occur 
during nuclear inversion in rod cell differ- 
entiation for nocturnal mammals (38). Specific 
genes undergo long-range directional motion 
upon transcriptional activation (39, 40). Long 
and highly transcribed genes can form ~5-1um 
giant loops, believed to be due to chromosome 
fiber stiffening (47). Certain double-strand break 
sites undergo large-scale, nuclear F-actin de- 
pendent relocation to the nuclear periphery 
(42). These DNA-based biological processes 
require a nuclear organization in which such 
movements are possible. Our results reveal the 
mechanical properties of chromatin in which 
such large-scale movements would only re- 
quire weak (near piconewton) forces. Although 
sustained unidirectional forces are unlikely to 
occur naturally in the nucleus, the magnitude 
of the forces and the time scale of force ex- 
ertion in our experiments are comparable to 
those of molecular motors such as SMC com- 
plexes and Pol II; that is, in the sub-piconewton 
(32) or low-piconewton (33) ranges and ap- 
plied over minutes (e.g., 10 min for Pol II to 
elongate through a 25-kb gene, 5-30 min for 
SMC complexes). Therefore, some molecular 
motors in the nucleus operate in a force range 
that is sufficient to substantially reorganize 
the genome in space. 

Future work will be important to expand 
and complement our results. Although the ge- 
nomic array that we used here is known to be 
chromatinized and has been used extensively 
to recapitulate and study functional chromatin- 
based processes (27-29), we cannot exclude 
that its repetitive and artificial nature might 
prevent some of our measurements from being 
applicable to nonrepetitive and native regions. 
Manipulating loci other than a subtelomeric 
locus on the longest chromosome (chromo- 
some 1) in other genomic contexts (e.g., hetero- 
chromatin or euchromatin) and in different 
cell types will be important to assess the gen- 
eralizability of our findings in various biolog- 
ical contexts. 


Our approach to mechanically manipulate 
and relocate genomic loci in the nuclear space 
opens many avenues for future research, from 
the study of interphase chromosome mechan- 
ics to the perturbation of genome functions, 
including transcription, replication, DNA dam- 
age repair, and chromosome segregation. By 
giving access to physical parameters and re- 
vealing fundamental scaling laws to describe 
chromatin mechanics, our work provides a 
foundation for future theories of genome 
organization. 
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SOLAR CELLS 


lon-modulated radical doping of spiro-OMeTAD for 
more efficient and stable perovskite solar cells 


Tiankai Zhang'}, Feng Wang’*+, Hak-Beom Kim’, In-Woo Choi?, Chuanfei Wang’, Eunkyung Cho‘, 
Rafal Konefal®, Yuttapoom Puttisong’, Kosuke Terado®, Libor Kobera®, Mengyun Chen?, Mei Yang?, 
Sai Bai’, Bowen Yang’, Jiajia Suo”*, Shih-Chi Yang’, Xianjie Liu®, Fan Fu, Hiroyuki Yoshida®”°, 
Weimin M. Chen’, Jiri Brus°, Veaceslav Coropceanu’, Anders Hagfeldt”®, Jean-Luc Brédas‘, 

Mats Fahiman®, Dong Suk Kim?, Zhangjun Hu’, Feng Gao™* 


Record power conversion efficiencies (PCEs) of perovskite solar cells (PSCs) have been obtained with the 
organic hole transporter 2,2',7,7'-tetrakis(N,N-di-p-methoxyphenyl-amine)9,9'-spirobifluorene (spiro-OMeTAD). 
Conventional doping of spiro-OMeTAD with hygroscopic lithium salts and volatile 4-tert-butylpyridine is a 
time-consuming process and also leads to poor device stability. We developed a new doping strategy for 
spiro-OMeTAD that avoids post-oxidation by using stable organic radicals as the dopant and ionic salts as 
the doping modulator (referred to as ion-modulated radical doping). We achieved PCEs of >25% and 
much-improved device stability under harsh conditions. The radicals provide hole polarons that instantly 
increase the conductivity and work function (WF), and ionic salts further modulate the WF by affecting the 
energetics of the hole polarons. This organic semiconductor doping strategy, which decouples conductivity and 
WF tunability, could inspire further optimization in other optoelectronic devices. 


etal halide perovskites have achieved 

impressive power conversion efficien- 

cies (PCEs) in both single-junction (7-6) 

and tandem solar cells (7, 8). However, 

a key challenge limiting their practical 
application is the trade-off between high effi- 
ciency and high stability, an issue determined 
by not only the perovskite materials (9) but also 
the charge transport layers (10). Currently, most 
high-performance perovskite solar cells (PSCs) 
with >24% PCE are based on the benchmark 
hole transport layer (HTL) spiro-OMeTAD doped 
by lithium bis(trifluoromethane)sulfonimide 
(LiTFSI) and 4-tert-butylpyridine (tBP) (77-13), 
a process that limits the stability of these high- 
efficiency PSCs. 

Conventional spiro-OMeTAD doping (Fig. 1A, 
top) involves LiTFSI to facilitate the generation 
of spiro-OMeTAD’*TFSI radicals and tBP to 
improve dopants’ solubility and film morphol- 
ogy (14). This recipe usually requires an in situ 
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oxidization process for a period of 10 to 24 hours 
in air to reach the optimal conductivity and 
work function (WF) (15). However, because 
of the low doping efficiency of this process, 
a large amount of dopants and additives 
(~56 mol % LiTFSI and ~330 mol % tBP) are 
required to generate ~10 mol % radicals (6). 
The residual LiTFSI, tBP, and by-products (e.g., 
Li,O,) are not only diffusible (17) but also sen- 
sitive to humidity and heat (78, 19), negatively 
affecting device stability. 

In addition, the complex in situ oxidation 
process makes it challenging to understand the 
mechanism of conventional spiro-OMeTAD 
doping, which limits further development of 
stable HTLs with high PCEs. Several efforts 
addressing the spiro-OMeTAD stability issue 
have focused on the hygroscopic lithium salt. 
For example, a CO, gas-forming treatment was 
recently used to remove the hygroscopic lithium 
species (20). Less hygroscopic dopants, including 
metallic salts (27-23), protic ionic liquids (24), 
and ex situ synthesized spiro-OMeTAD*”*(TFST )» 
radicals (25, 26), have also been used to replace 
the oxidant LiTFSI. Such doping usually re- 
quires the addition of volatile tBP, whose role 
in the doping process has been controversial 
(27, 28). In particular, tBP would react with 
radicals and coordinate with LiTFSI to form 
by-products (Fig. 1A, top), which not only neg- 
atively affects device performance but also 
prevents a full understanding of the spiro- 
OMeTAD doping mechanism. 

We developed a stable doping strategy for 
instant and effective doping of spiro-OMeTAD, 
which we call ion-modulated (IM) radical 
doping (Fig. 1A, bottom), that relies on two 
components. One is the presynthesized stable 
spiro-OMeTAD?"*(TESI ), radical (synthesis 
and characterizations in figs. Sl and S2), which 
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Fig. 1. Comparison of perov- A 
skite solar cells (PSCs) 
based on the conventional 
and ion-modulated (IM) 
radical doping strategies. 
(A) Illustration of the complex 
in situ reaction processes in 
the conventional doping pro- 
cess (top) and the clean, instant 
IM radical doping strategy 
(bottom) of spiro-OMeTAD. 
The radical and ionic 

salt were dissolved in 1,1,2,2- 
tetrachloroethane. (B) Current 
density—voltage (J-V) curves 
of PSCs (SnO> electron 
transport layer) based on 
conventional doping, radical 
doping, and IM radical doping 
of spiro-OMeTAD. (C) J-V 
curves of PSCs (mesoporous 
TiO» electron transport 

layer) based on conventional 
and IM radical doping of 
spiro-OMeTAD. (D and E) PCE 
tracking of unencapsulated 
PSCs based on conventional 
and IM radical doping of 
spiro-OMeTAD under 70 + 

5% humidity (D) and 70 + 
3°C thermal aging (E). Error 
bars denote SD. 
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090 02 


acts as the main dopant to improve conductivity 
and WF. The second is the ionic salt 4-tert-butyl-1- 
methylpyridinium bis(trifluoromethylsulfonyl)imide 
(TBMP* TEST ) to further modulate the WF. In 
this doping strategy, the localized ionic envi- 
ronment created by the ionic salt was used to 
manipulate the energetics of the hole polarons 
and decouple the conductivity and WF tuna- 
bility. The doped spiro-OMeTAD based on our 
IM radical doping strategy delivered PSCs 
simultaneously with high efficiency (PCE > 25%) 
and high stability (739 for ~1200 hours under 70 + 
5% relative humidity (RH) and Tgp for ~800 hours 
under 70° + 3°C without encapsulation), mini- 
mizing the trade-off between efficiency and 
stability of PSCs. In addition, the IM radical 
doping strategy provides a facile yet effective 
approach to separately optimize the conduc- 
tivity and WF of organic semiconductors for a 
variety of optoelectronic applications. 
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PSC performance based on 
IM radical-doped spiro-OMeTAD 
We first compared the distinctive effects of 
conventional and IM radical doping of spiro- 
OMeTAD using (FAPbI3)o.99(MAPbBrs)o.01 (FA, 
formamidinium; MA, methylammonium) PSCs 
with an SnO, electron transport layer (ETL) 
(fig. S3) (12). The conventional spiro-OMeTAD 
doping process requires an oxidation time 
of ~24 hours to reach the optimized PCE of 
22.2% (Fig. 1B, fig. S4A, and table S1). The grad- 
ual improvement in device performance with 
increasing oxidation time is consistent with 
the conductivity increase (fig. S5A) that re- 
sulted from the continuous generation of spiro- 
OMeTAD*TFSI radicals, as evidenced by the 
ultraviolet-visible-near-infrared (UV-Vis-NIR) 
absorption spectra (fig. S5B) (29, 30). 

In our IM spiro-OMeTAD doping strategy, 
the first additive consists of presynthesized 


X Byproducts: t8P*-R (R = -H or -Spiro-OMeTAD), 


Li(tBP),TFSI complex, LiO,, .. 


2X Long-time post-oxidation to generate radicals 


IM radical doping strategy 
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spiro-OMeTAD*"*(TEST ). (37, 32), which can be 
immediately converted into spiroOMeTAD™ TFSI” 
monoradicals through comproportionation 
with the neutral spiro-OMeTAD (25): 


spiro-OMeTAD + spiro-OMeTAD”** (TFSI” ). 
— 2 spiro-OMeTAD** TFSI- (1) 


We note that more radical cations (and hence 
a higher conductivity) are obtained at lower 
dopant levels using the current approach than 
using LiTFSI, as indicated by absorption spec- 
tra and conductivity measurements (fig. S5). 
Upon the incorporation of radicals, the device 
performance improved relative to undoped 
spiro-OMeTAD (fig. S4B), and an optimal per- 
formance was reached with the doping radical 
ratio of ~14 mol % (calculated to the ratio of 
monoradicals). However, compared to a high 
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Fig. 2. Effects of radicals and ionic salts on conductivity and energetics. 

(A and B) J-V curves of the hole-only devices (A) and conductivity of the spiro-OMeTAD 
films doped with different radical amounts (B). Inset of (B) shows the structure of the 
hole-only devices. (C) J-V curves of the hole-only devices with 14 mol % radicals and different 


open-circuit voltage (V,,) of 1.15 V achieved 
with the conventional doping recipe, the op- 
timized V,, of spiro-OMeTAD doped with 
radicals was 1.04 V (Fig. 1B). 

Upon addition of the second additive (ionic 
salt, TBMP*TFSI ), the V,, of the solar cells 
increased and a V,, of 1.17 V was achieved with 
20 mol % TBMP* TFSI. The resulting PCE of 
23.4% was even higher than that based on the 
conventional doping (Fig. 1B). Further increas- 
ing the TBMP*TFSTI content led to a decrease 
in the fill factor (FF) (fig. S4C and table S2) 
possibly because of ion aggregation (fig. S6). 

The high V,, of the optimized devices was 
also consistent with the values of electro- 
luminescence external quantum efficiency 
(ELgge), Which increased from 0.05% to 5.4% 
upon the addition of 20 mol % TBMP*TFSI” 
into the HTL (fig. S7), indicating reduced non- 
radiative recombination. Remarkably, based 
on the meso-TiO, scaffold with FAPbI3 as the 
active layer, the IM radical doping of spiro- 
OMeTAD delivered a PCE of 25.1% (table S3, 
certified 25.0% in fig. S8), which was again 
higher than that of the control device with 
conventionally doped spiro-OMeTAD (Fig. 1C). 
The PCE statistics demonstrated that the IM 
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radical doping strategy, which did not involve 
a post-oxidation process, was more control- 
lable than the conventional doping process 
(fig. S9). 

Because the IM radical doping strategy com- 
pletely removed the need to use hygroscopic 
LiTFSI and volatile tBP, an improvement in 
device stability can be expected. We system- 
atically compared the device stability with IM 
radical doping versus conventional doping as 
a function of RH, heat, and illumination. 
Relative to the conventional doping of spiro- 
OMeTAD, the IM radical doping strategy ex- 
tended the 7,5 of PSCs (unencapsulated) from 
~96 to ~1240 hours under high RH of ~70 + 
5% (Fig. 1D and fig. S10), and 7g9 from ~264 to 
~796 hours under ~70° + 3°C (Fig. 1E and 
fig. S11). The improved device stability with 
the IM radical-doped HTL was ascribed to not 
only the improved stability of HTL but also the 
prevention of phase and thermal degradation 
of the perovskite active layer under high RH 
and under heat stress, as evidenced from op- 
tical images (fig. S12) and x-ray diffraction re- 
sults (figs. S13 and S14) of the degraded devices. 

Scanning electron microscopy (SEM) images 
provided more details of the protection effect 
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amounts of TBMP*TFSI. The inset shows conductivity. (D and E) Fermi-level and HOMO 
onsets of the spiro-OMeTAD films doped with different radical amounts (D) and different 

TBMP*TFSI amounts (14 mol % radicals) (E). Error bars denote SD. (F) Illustration 

of the band alignment between the perovskite layer and the HTL with different WF values. 


from the spiro-OMeTAD HTL based on IM 
radical doping. A large number of pinholes 
were observed in degraded devices based on 
the conventionally doped HTLs, implying that 
LiTFSI residuals absorbed moisture and that 
tBP volatilized gradually. Both effects acce- 
lerated the degradation of the perovskite ac- 
tive layer (figs. S15 and S16). In contrast, the 
devices based on the IM radical doping strat- 
egy maintained a compact and uniform mor- 
phology after long-term exposure to high RH and 
thermal stress (figs. S15 and S17), protecting the 
perovskite layer underneath. In addition, under 
maximum power point (MPP) tracking, the 
PCEs of PSCs based on conventional doping 
decayed to ~85% after 500 hours of continuous 
illumination. Devices based on the IM radical 
doping strategy maintained ~95% of the initial 
PCE after the same operational time (fig. S18), 
similar to that of devices using inorganic HTLs 
(33, 34) or undoped polymer HTLs (35). 


The roles of radicals and ionic salts on 
spiro-OMeTAD doping 

Our IM radical doping strategy provided an 
instantly effective and stable doping of spiro- 
OMeTAD. In addition, this simple process 
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allowed us to decouple and investigate the 
fundamental functions of additives (radicals 
and ionic salts). We focused on two physical 
parameters that are key to the effectiveness of 
the HTL: conductivity and energy levels [in- 
cluding both WF and ionization energy (IE)]. 

To investigate the HTL conductivity, we used 
hole-only devices with the structure ITO/PEDOT: 
PSS/doped spiro-OMeTAD/Au [ITO, indium tin 
oxide; PEDOT, poly(3,4-ethylenedioxythiophene); 
PSS, polystyrene sulfonate]. The effects of or- 
ganic radicals on the film conductivity (Fig. 2, 
A and B) showed two distinct stages as a 
function of an increasing amount of organic 
radicals. There was initially an extremely high 
doping efficiency with a small doping ratio of 
radicals (<2 mol %; Fig. 2B, light blue area), 
which suggested efficient hole polaron gener- 
ation. In particular, 0.1 mol % of radicals could 
increase the conductivity of the film by almost 
four orders of magnitude, from 1.10 x 10-8 $ 
cm‘ to 8.03 x 10° Sem“. This increase could 
be attributed to the filling of deep trap states 
by dopant-generated charge carriers, which 
would bring increased carrier concentration 
as well as higher carrier mobility (36). 

For higher doping concentrations, the hole- 
only current density exhibited a linear depen- 
dence over the entire voltage range from 0 V 
to 5 V, indicative of a high hole concentration 
in the system. In the second stage (doping 
ratio from 2 mol % to 60 mol %, marked by the 
light red area), the doping efficiency nearly sat- 
urated and the conductivity peaked at ~1.00 x 
10°? S cm‘ at ~14 mol % radicals (Fig. 2B). This 
doping saturation phenomenon was consistent 
with previous reports in other organic semi- 
conductor systems and is ascribed to the 
increase in energetic disorder induced by 
long-range Coulomb interactions when in- 
creasing the number of dopants (37). The 
addition of the TBMP*TFSI ionic salt had 
negligible effects on the film conductivity over 
a wide range of concentrations (up to 20 mol %) 
(Fig. 2C), beyond which a slight decrease in 
film conductivity was observed, possibly be- 
cause of the aforementioned phase aggregation 
and increased disorder in the film (see below). 

In addition to conductivity, the energy level 
alignment at the HTL interface is another 
parameter that is critically important to device 
performance. An optimal energy level of the 
HTL helps to facilitate hole extraction and 
eliminate interfacial nonradiative recombina- 
tion, contributing to high device performance 
(especially high V,,). We investigated the im- 
pact of the radicals on the relevant energy 
levels by ultraviolet photoelectron spectroscopy 
(UPS). For the pristine spiro-OMeTAD, the ion- 
ization energy (IE) was 5.08 eV, which accord- 
ing to Koopmans’ theorem (38) corresponds 
to the energy of the highest occupied molecular 
orbital (HOMO), and the WF was 4.36 eV (fig. 
S19A). The density functional theory (DFT) 
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results showed that the IE of an isolated spiro- 
OMeTAD molecule was ~5.46 eV, and this value 
decreased to 5.03 eV when the calculations were 
performed by taking into account an effective 
medium with a dielectric constant of 2.4 (fig. S20). 

With increasing amounts of radicals, both IE 
and WF increased (Fig. 2D). A more detailed 
comparison revealed that the difference be- 
tween WF and IE decreases along with in- 
creasing dopant concentration (Fig. 2D), 
emphasizing the extent to which the Fermi 
level moves toward the HOMO level, which is 
a key signature of p-doping. Combining the 
conductivity results in Fig. 2B, we noticed 
that a radical concentration of >2 mol % could 
affect the energy levels of the film despite 
almost no improvement in conductivity. The 
shift of energetic levels saturated at ~14 mol % 
radical doping, which is due to the limit in 
charge transfer and hole generation (37). 

Based on the 14 mol % radical-doped sys- 
tem, we further assessed the effect of the 
TBMP*TFSI ionic salt on the film energy 
levels. In contrast to the simultaneous change 
of the HOMO level and WF upon the addition 
of radicals, the ionic salt increased the WF to 
5.08 eV with 20 mol % TBMP" TFSI but had a 
negligible effect on the HOMO levels (Fig. 2E 
and fig. S19B). The slightly decreased WF with 
30% TBMP*TFSI was ascribed to the effects 
of ion aggregation (fig. S6). As such, the addi- 
tion of TBMP"TFSI breaks the doping limit of 
radical doping, resulting in a Fermi level that 
is very close to the HOMO level (the difference 
is as small as 0.12 eV, a value much smaller 
than those obtained in traditional organic 
semiconductor doping). The promoted doping 
efficiency after TBMP*TFSI incorporation was 
also confirmed by the increased carrier concen- 
tration, as indicated from a Mott-Schottky 
analysis (fig. $21). 

In PSCs, as long as the HOMO energy of 
the HTL remains above the valence band of 
perovskites, it has little impact on the device 
Vo- value. For example, different HTLs with 
HOMO energies between -5.2 and -5.4 eV 
have almost the same J-V curves (39). Within 
a suitable HOMO energy range, an optimal 
WF of the HTL aligned with the quasi-Fermi 
level (QFL) of the perovskite active layer is 
critical to minimizing the voltage loss (40, 41). 
As illustrated in Fig. 2F, the increase of the 
WF by the TBMP* TFSI reduced the hole ex- 
traction barrier between the perovskite and 
the IM radical-doped HTL film. This change 
enables a high QFL splitting in the perov- 
skites under illumination by eliminating the 
interfacial nonradiative recombination loss. 
Thus, this WF modulation by the ionic salt is 
important, as this V,. improvement made it 
possible for the device based on our IM radical 
doping strategy to reach the performance com- 
parable to that based on conventional doping 
(refer to Fig. 1, B and C). 


From these conductivity and energy level 
investigations, the different functions assumed 
by the organic radicals and ionic salts become 
clear. The radicals enhance the conductivity 
and affect the energy levels (both HOMO and 
WF); the ionic salts further optimize the WF and 
contribute to optimal V,. values in the devices. 

The effectiveness of the radicals as a means 
of tuning the conductivity and energy levels can 
be understood with a charge transfer model 
between neutral spiro-OMeTAD and the spiro- 
OMeTAD*TFSI radical. For neutral spiro- 
OMeTAD, the Fermi level (Z;) corresponds to 
an energy about midway between the lowest un- 
occupied molecular orbital (LUMO) and HOMO 
levels (fig. S20). The low film conductivity in the 
absence of dopant can be understood by poor 
film crystallinity and low charge carrier con- 
centrations, which are determined by thermal 
excitation of charge carriers from HOMO to 
LUMO levels. 

Upon radical doping, spiro-OMeTAD™*TFSI- 
provides additional electronic states (42) 
(figs. S22 to S24)—specifically, bound hole 
states corresponding to singly unoccupied 
molecular orbital (SUMO) levels stabilized by 
Coulombic attraction from the TFSI’ coun- 
terion. Electron transfer from neutral spiro- 
OMeTAD (host) to spiro-OMeTAD**TFSI- 
radical complex (dopant) leads to the genera- 
tion of free hole polarons and increases the con- 
ductivity (Fig. 3A). Concurrently, the hole filling 
of the transport states downshifts the Fermi 
level and increases the WF (Fig. 2D) (43, 44). 


The mechanism of ionic salts modulating 
spiro-OMeTAD doping 
We now turn to a discussion of how ionic 
salts can further tune the WF. We did not ob- 
serve oxidation or reduction between ionic 
salts and neutral spiro-OMeTAD/spiro- 
OMeTAD"* TFSI (figs. S25 and $26). A straight- 
forward possibility for inducing the WF change 
is the introduction of a macroscopic dipole, 
which could be induced either by a particular 
spiro-OMeTAD orientation (45) or by an in- 
terfacial TBMP*TFSI accumulation (46). 
We excluded the former on the basis of the 
grazing-incidence wide-angle x-ray scattering 
(GIWAXS) results (fig. S27), which indicated nei- 
ther stacking mode change nor crystallization 
amelioration upon addition of TBMP*TFST. 
The latter could be ruled out because time- 
of-flight secondary ion mass spectroscopy 
(TOF-SIMS) measurements showed a uni- 
form distribution of S’, C4Hg , C;H,N , and CF; 
across the entire HTL (fig. S28), indicating 
the absence of surface dipoles formed by 
TBMP"TEST . In addition, the incorporation 
of ionic salts increased the WF more than its 
effects on the IE (Fig. 2D), further ruling out 
the possibility of the surface-dipole effect. 
The exclusion of macroscopic dipole forma- 
tion motivated us to understand the WF tuning 
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Fig. 3. Molecular-level doping mechanisms of the IM radical doping. (A) Illustra- 
tion of the charge transfer and doping mechanisms with spiro-OMeTAD*TFSI- 
radicals. (B) 1H hr-NMR spectra of spiro-OMeTAD (black line), spiro-OMeTAD/ 
spiro-OMeTAD™*TFSI- mixture without TBMP*TFSI” (red line; inset shows 
magnified view of d+e+f peak in the range of 9 to ~7.5 ppm), and spiro-OMeTAD/ 
spiro-OMeTAD™*TFSI” mixture with TBMP*TFSI” (blue line) in the range 9 to 
6.25 ppm. (Peaks b, c, and d+e+f refer to the aromatic protons signal.) (C) EPR 
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signals of doped spiro-OMeTAD (14 mol % radicals) with and without TBMP*TFSI- 
at low temperature and room temperature. (D) Zoom-in spectra near the 
valence band of neutral spiro-OMeTAD, 14 mol % radical-doped spiro-OMeTAD 
film, and 14 mol % radical and 20 mol % TBMP*TFSI"-doped spiro-OMeTAD 
films. (E) Illustration of ionic salts’ effect on WF modulation in the framework of 
the IM radical doping strategy. (F) Temperature-dependent conductivity evolution 
of doped spiro-OMeTAD (14 mol % radicals) with and without TBMP*TFSI-. 


29 JULY 2022 + VOL 377ISSUE 6605 499 


RESEARCH | RESEARCH ARTICLES 


A 
a 
& 
oO 
< 
E 
= 
a 
5 
ss) —MAPDbI, 
5 mn C8 p pel FAG g MAS o)y oP D(Ip gry 2)5 
E ——— Two-step method 
3 ——— FAPbI, with IL 
04 06 08 
Voltage (V) 
B 6.0 


wo 
an 


Energy levels (eV) 
on 
Nm 


44 
EDMPA*TFSI- 


PMPIm 


mechanism at the molecular level. Both radi- 
cals and salts are ionic, so Coulomb interactions 
among them are to be expected and would 
affect their frontier orbital energetics. Details 
on the Coulomb interactions between the ionic 
salts and radicals were obtained from both 
high-resolution liquid and solid-state nuclear 
magnetic resonance (hr- and ss-NMR, respec- 
tively) spectroscopies (Fig. 3B). Although NMR 
spectroscopy cannot provide direct informa- 
tion on radicals given the strong interactions 
of unpaired electrons with NMR active nuclei, 
comparison between different samples provided 
useful information on the interactions in our 
mixture. We observed electron transfer between 
spiro-OMeTAD*TFSI and spiro-OMeTAD, in 
that all ‘H signals (labeled a to f) of spiro- 
OMeTAD hr-NMR spectra were broadened 
after the addition of spiro-OMeTAD™ TFSI (Fig. 
3B and fig. S29). The broad signal half-widths 
suggested that spiro-OMeTAD”*TFSI and neu- 
tral spiro-OMeTAD were near each other and 
mutually exchanging electrons, which could 
be attributed to n-n stacking between fluo- 
rene moieties of spiro-OMeTAD and spiro- 
OMeTAD*TESI (47, 48). However, the 'H 
NMR signals of spiro-OMeTAD d+e+f were 
more broadened while the b and c peaks 
were only slightly affected, indicating that the 
electron exchange between spiro-OMeTAD 
and spiro-OMeTAD**TFSI influenced the mo- 
lecules only partially on the fluorene part (49). 
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Upon addition of TBMP"TFST into the spiro- 
OMeTAD and spiro-OMeTAD**TFSI mixture, 
the 'H NMR signals from all aromatic protons 
(b, c, and d+e+f) of spiro-OMeTAD nearly dis- 
appeared (Fig. 3B). This indicated more effec- 
tive electron exchange between spiro-OMeTAD 
and spiro-OMeTAD™TFSI in the presence of the 
TBMP'TFSI salt. Because direct interactions 
between TBMP*TFSI and spiro-OMeTAD were 
negligible (fig. S29A), we attributed the effect of 
TBMP*TFSI on the 'H hr-NMR signals (Fig. 
3B) to Coulomb interactions between spiro- 
OMeTAD™*TFSI and TBMP*TFST, which en- 
hanced charge transfer between spiro-OMeTAD 
and spiro-OMeTAD”™*TFSI in the presence of the 
TBMP*TFSI salt. The emergence of these inter- 
actions is consistent with the shift and narrow- 
ing of the °F ss-NMR peak of TFSI” (fig. $30). 

Acomplementary electron paramagnetic 
resonance (EPR) study from 77 K to room 
temperature demonstrated a Lorentzian broad- 
ening of EPR spectra in our radical-doped 
spiro-OMeTAD (for samples both with and 
without TBMP"TFSTI ) (Fig. 3C and figs. $31 
and S32). We attributed this effect to the dy- 
namic broadening due to the electron self- 
exchange between the neutral spiro-OMeTAD 
and spiro-OMeTAD™ TFSI radical. Within the 
intermediate exchange rate limit, the EPR 
broadening upon TBMP*TFST incorporation 
was attributed to the increased electron transfer 
rate between the neutral spiro-OMeTAD and 


Fig. 4. Generality of the IM 
radical doping strategy. (A) J-V 
curves of different PSCs based on 
spiro-OMeTAD HTL following the 
IM radical doping strategy. 

(B) Molecular structures of six 
additional ionic salts used in 

this work and their effects on 

the energetic levels of the doped 
spiro-OMeTAD films. EDMPA, 
ethyldimethylpropylammonium; 
PMPlm, 3-methyl-1-propylpyridinium; 
BMIM, 1-propyl-3-methyl-imidazolium. 
(C) J-V curves of PSCs based on 
the spiro-OMeTAD HTL doped 

with six additional ionic salts 

(14 mol % radicals). 
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spiro-OMeTAD TFSI radical that decreased 
the spin lifetime (50). 

We further investigated the impact of these 
Coulomb interactions on the frontier orbital 
energies by combining the photoelectron spec- 
troscopy data and DFT results. The incorpora- 
tion of TBMP*TFSI not only decreased the 
energy offset between the HOMO level and 
the Fermi level but also changed the density 
of states (DOS) of frontier orbitals (Fig. 3D). 
In addition, low-energy inverse photoelectron 
spectra (LEIPS) indicated that the SUMO level 
of spiro-OMeTAD™* downshifted from -4.06 
to -4.30 eV upon addition of TBMP*TFSI- 
(fig. S33). The DFT calculations highlighted the 
effect of the ionic salt on the molecular orbital 
levels (figs. S34 and S35), which we rationalized 
by considering that the TBMP*TFSTI ionic salt 
screened the Coulomb attraction between the 
counterion and hole polaron in the spiro- 
OMeTAD™* TFSI complex (5). In this context, 
the impact of the TBMP*TFSI can be viewed 
as a modulation of the activation energy of 
doping (E°) related to the energy difference 
between the doping states and transport states 
(Fig. 3E), because neutral spiro-OMeTAD mol- 
ecules close to TFSI anions from the ionic 
salts can homogenize the electrostatic potential 
of the transport states, resulting in decreased 
ES and increased WF. 

Given this understanding of the impact of 
ionic salts on the energetic levels, we investigated 
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why there was no conductivity enhancement 
with increasing WF, a feature that makes it 
decouple conductivity and WF tuning. De- 
creasing the ES of dopants upon ionic salt ad- 
dition (Fig. 3E) enabled an increasing fraction 
of hole polarons to be thermally populated. 
This effect, taken alone, would lead to a con- 
ductivity increase. However, the temperature- 
dependent conductivity indicated that the ionic 
salts also increase disorder. The temperature- 
dependent conductivity measurements revealed 
a faster-decreasing rate of conductivity upon 
addition of TBMP*TFSI and suggested an in- 
creased long-distance intermolecular charge 
hopping potential barrier (EP) from 0.14 eV 
to 0.20 eV (52, 53) (Fig. 3F and fig. $36). This 
more difficult long-distance intermolecular 
charge transfer implied a decreased mobility 
that might be induced by increased disorder or 
more scattering from ionic clusters (54, 55). 
Overall, the ionic salts played a dual role: 
They increased the free hole population but 
also increased disorder. These two effects largely 
compensated each other and led to negligible 
conductivity changes over a broad range of 
ionic salt concentrations. 


The generality of the IM radical 
doping strategy 
Stimulated by the possibilities of precise 
energetic manipulation and facile conduc- 
tivity tunability, we applied the IM radical 
doping approach to perovskites with differ- 
ent compositions and devices with different 
architectures (planar or mesoporous struc- 
tures). In addition to the PSCs based on SnO./ 
(FAPbI3)9.99 MAPbBr3)o9.9, and TiO./FAPbIs; 
shown in Fig. 1, B and C, IM radical-doped 
spiro-OMeTAD HTLs also resulted in high 
PCEs in perovskites with different band gaps, 
different fabrication methods (one-step or two- 
step), and with or without ionic liquid additives. 
All PCEs were similar to (or slightly higher 
than) those based on conventionally doped 
spiro-OMeTAD HTL (Fig. 4A, fig. S37, and 
tables S4 and S5), indicating that our IM ra- 
dical doping strategy is applicable to different 
PSCs. In addition, we observed improved stab- 
ility (both under high RH and high tempe- 
rature) for all the PSCs using IM radical-doped 
spiro-OMeTAD compared with those using 
conventionally doped spiro-OMeTAD (fig. S38). 
The generality of the IM radical doping 
strategy was also demonstrated in terms of the 
ionic salts. We investigated six additional ionic 
salts, based on typical anions [bis(trifluorome- 
thylsulfonyl)imide, hexafluorophosphate, tet- 
rafluoroborate, p-toluenesulfonate] and cations 
(ethyldimethylpropylammonium, propylpyridi- 
nium, imidazolium, methylpyridinium) (Fig. 4B). 
All of the ionic salts could effectively tune the 
WF (Fig. 4B and fig. S39) without negatively 
affecting the conductivity (fig. S40). Their in- 
corporation resulted in high V,, values (~1.12 to 
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1.16 V) and high PCEs (~22% to 23%) in SnO./ 
(FAPbI3)o.99 MAPbBr3)o,o;-based PSCs (Fig. 4C 
and table S6). Although the use of different 
cations resulted in negligible WF modifications, 
we noted that switching from TFST to other 
anions could slightly decrease the WF, which 
pointed to the anions playing a more promi- 
nent role in the energetic modulation in the 
framework of the IM radical doping strategy. 
A possible reason is that the anions coming 
from the ionic salts can exchange with TFSI" 
in the radical complex, resulting in new radical 
complex species with different energetics. 


Discussion 


Because the energetics of the hole polarons 
are sensitive to their local ionic environment, 
the IM radical doping strategy provides room 
to manipulate the WF as a function of the 
nature of the ionic salts. WF tuning could be 
achieved over a large range by controlling the 
interaction strength between the ionic salts 
and the radical complex and is more easily ad- 
justable than WF modulation through inter- 
facial molecular dipoles (46). As such, the IM 
radical doping strategy fills the gap between 
several previous strategies to enhance WF and 
conductivity and greatly expands the toolbox 
for organic semiconductor doping (56, 57). 
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CANCER 


Mechanism-based design of agents that selectively 
target drug-resistant glioma 


Kingson Lin*?*+, Susan E. Gueble”+, Ranjini K. Sundaram’, Eric D. Huseman’, 


Ranjit S. Bindra?**, Seth B. Herzon*** 


Approximately half of glioblastoma and more than two-thirds of grade II and III glioma tumors lack 
the DNA repair protein 0°-methylguanine methyl transferase (MGMT). MGMT-deficient tumors 
respond initially to the DNA methylation agent temozolomide (TMZ) but frequently acquire resistance through 
loss of the mismatch repair (MMR) pathway. We report the development of agents that overcome this 
resistance mechanism by inducing MMR-independent cell killing selectively in MGMT-silenced tumors. These 
agents deposit a dynamic DNA lesion that can be reversed by MGMT but slowly evolves into an interstrand 
cross-link in MGMT-deficient settings, resulting in MMR-independent cell death with low toxicity in vitro and 
in vivo. This discovery may lead to new treatments for gliomas and may represent a new paradigm for designing 
chemotherapeutics that exploit specific DNA repair defects. 


enetic instability is a hallmark of cancer 
and typically arises from mutations in 
key DNA damage repair and/or reversal 
proteins [collectively referred to here as 
the DNA damage response (DDR)] (J). 
Intrinsic DDR defects can be exploited with 
DDR inhibitors through the concept of syn- 
thetic lethality, which is defined as a loss of 
viability resulting from the disruption of two 
genes or pathways that, if disrupted individ- 
ually, are nonlethal (2, 3). Notable examples 
of synthetic lethal interactions between 
tumor-associated DDR defects and DDR in- 
hibitors include (i) homologous recombina- 
tion (HR)-defective tumors and inhibitors of 
polymerase 0 (Pol 0) (4); (ii) ataxia-telangiectasia 
mutated (ATM)-mutant tumors and ataxia 
telangiectasia and Rad3-related (ATR) inhib- 
itors (5); and (iii) mismatch repair (MMR)- 
deficient tumors and inactivation of Werner 
syndrome helicase (WRN) (6). HR-defective 
tumors also can be selectively targeted by 
inhibitors of poly(ADP-ribose) polymerase 
(PARP) (2, 3). However, the mechanistic basis 
for this toxicity appears to depend on trapping 
of PARP at sites of DNA damage, which likely 
induces more complex damage that requires 
HR for resolution (7). In each of these exam- 
ples, selective tumor cell killing by means of 
the DDR protein inhibitor relies on either the 
induction or persistence of DNA damage or 
aberrant DNA structures. 
These findings suggested to us that genotox- 
ins could be tailored to exploit differential 
sensitivities that arise from specific tumor- 
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associated DDR defects. This approach avoids 
the need to engage DDR proteins directly, cir- 
cumventing resistance mechanisms that arise 
from mutations in the ligand binding site (8) 
while minimizing off-target effects in healthy, 
DDR-proficient cells. To achieve this, we con- 
ceived a mechanistic strategy in which a single 
agent modifies DNA through two successive 
chemical steps (Fig. 1A). The first chemical re- 
action is designed to generate a primary DNA 
lesion that is rapidly removed by healthy, DDR- 
proficient cells. The second chemical reaction 
is engineered to slowly transform the primary 
modification into a more toxic secondary lesion. 
We anticipated that if the rate of primary lesion 
repair were sufficiently rapid in healthy cells, 
the secondary lesion would accumulate only 
in the DNA of DDR-deficient tumor cells. This 
two-step pathway would overcome resistance 
mechanisms that mitigate the toxicity of the 
primary lesion, which have been implicated in 
resistance to various chemotherapies, includ- 
ing anthracyclines [impairment of nucleoside 
excision repair (NER)] (9), topoisomerase in- 
hibitors (loss of nonhomologous end join- 
ing repair) (70), and antimetabolite (77) and 
platinum (72) agents (mutations in MMR). Fur- 
thermore, delivering these bespoke lesions 
by using established chemotherapy scaffolds 
could facilitate rapid translation into the clinic, 
owing to decades of use in cancer patients. We 
examine this strategy in the context of glio- 
blastoma (GBM). 

GBM is the most common and devastating 
form of brain cancer, with a 5-year survival 
rate of ~5%. Approximately half of GBMs lack 
the DDR protein O°-methylguanine methyl- 
transferase (MGMT) through promoter hyper- 
methylation. MGMT silencing occurs at an even 
greater frequency in grade II and III gliomas 
(more than 70% of cases), and these tumors 
are also largely incurable (73). Mechanistically, 
MGMT removes O°-alkylguanine adducts by 
transferring the alkyl adduct to an active site 


cysteine through an Sy2 displacement. Pa- 
tients with MGMT-deficient tumors (referred 
to here as MGMT- tumors) are treated with 
temozolomide (TMZ; Ia), a prodrug that con- 
verts under physiological conditions to the 
potent methylating agent methyl diazonium (1c) 
through the intermediacy of 3-methyl-(triazen-1- 
yl)-imidazole-4-carboxamide (MTIC; 1b) (Fig. 
1B). N7-Methylguanine and N3-methyladenine 
are the major sites of methylation (70 and 9%, 
respectively) but are readily resolved by the 
base excision repair (BER) pathway. By con- 
trast, although O°-methylguanine (O°MeG; 
3) adducts derived from TMZ (1a) only com- 
pose ~5% of alkylation products, these lesions 
persist in the genome of MGMT- tumors [but 
are readily reversed by healthy (MGMT+) 
cells] (Fig. 1C). O°MeG (3) lesions are thought 
to induce formation of DNA double-strand 
breaks (DSBs) and tumor cell death through 
an MMR-dependent mechanism (/4). MGMT 
status is a predictive biomarker for initial re- 
sponse to TMZ (1a) in GBM, with a significant 
overall survival benefit in the up-front set- 
ting for patients with these cancers (15). How- 
ever, it is now well established (76) that 
acquired clinical resistance to TMZ (1a) through 
MMR mutations abrogates its efficacy, lead- 
ing to recurrent GBM and death in nearly 
all patients. Frequently, TMZ (1a) is used as 
adjuvant therapy for grade III and high-risk 
grade II gliomas; however, it remains non- 
curative, with recurrences typically occur- 
ring over 2 to 10 years. In ~80% of patients, 
recurrences coincide with transformation 
to higher-grade tumors resistant to TMZ (1a), 
associated in ~50% of cases with acquisition of 
a distinct hypermutation signature secondary 
to MMR deficiency, resulting in reduced sur- 
vival (17, 18). 

We hypothesized that we could deploy the 
strategy outlined in Fig. 1A to develop agents 
that overcome the resistance associated with 
MMR loss while maintaining TMZ’s (1a) se- 
lectivity for MGMT-silenced tumors. These 
agents would deposit a primary lesion suscep- 
tible to Sy2-mediated removal by MGMT that 
could undergo a further chemical transfor- 
mation to a secondary lesion capable of killing 
MGMT-deficient tumor cells in an MMR- 
independent manner. To maintain the ther- 
apeutic index (TI) between MGMT- tumor 
cells and MGMT+ healthy cells, the primary 
lesion must undergo MGMT-mediated repair 
faster than it undergoes transformation to the 
secondary lesion. With these considerations 
in mind, we developed an agent capable of 
depositing a 2-fluoroethyl lesion at O°-guanine. 
We expected that O°FEtG (5) would evolve 
slowly to the (N1,0°Et)G intermediate 6, which 
could then undergo ring-opening by N3 of the 
complementary cytidine base (’7) to form the 
G(N1)-C(N3) interstrand cross-link (ICL) 8 
(Fig. 1E). This hypothesis is based on the 
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Fig. 1. Overview of mechanistic strategy and structures of agents used 
in this study. (A) Underlying mechanistic design. Systemic administration of 
a bifunctional agent is envisioned to form a primary lesion that is rapidly 
resolved by healthy (DDR+) but not DDR-deficient (DDR-) cells. The 
persistence of the primary lesion allows it to evolve slowly to a more toxic 
secondary lesion. (B) Under physiological conditions, TMZ (1a) converts 

to MTIC (1b), which in turn decomposes to methyl diazonium (1c). 

(C) 0°-Methylguanine (3) is the most clinical significant alkyl nucleobase 


generated by methyl! diazonium (1c) and is rapidly reverted to dG (2) 

by MGMT. (D) Conversion of KL-50 (4a) to KL-85 (4b) in aqueous 

solution followed by decomposition to 2-fluoroethy! diazonium ion (4c). 

(E) 2-fluoroethy! diazonium ion (4c), derived from KL-50 (4a), is proposed 
to lead to the formation of G(N1)—C(N3)Et ICL 8 specifically in MGMT- cells. 
This mechanism is independent of MMR status. (F) Structures of the 
triazenes 9 to 13, mitozolomide 12a, and lomustine (CCNU; 14). Syntheses 
are available in the supplementary materials. 


finding that the corresponding nucleoside, 
O°-(2-fluoroethyl)guanosine (S1), under- 
goes a hydrolytic rearrangement to N1-(2- 
hydroxyethyl)guanosine (S3) with a half-life 
of 18.5 hours (37°C, pH 7.4) (fig. S1A) (79) and 
the recent determination that oligonucleo- 
tides containing a single photocaged O°-(2- 
chloroethyl)guanosine residue can undergo 
G-C cross-linking through a parallel mech- 
anistic pathway (20). Although the rates of 
reversal of O°FEtG (5) by MGMT are unknown, 
MGMT reacts rapidly with methylated calf 
thymus DNA (second-order rate constant = 
1x 10° M™ min“) (27); and oligonucleotides 
that contain O°-methyl-, O°-benzyl-, and O°- 
heteroarylmethylguanine residues possess 
nanomolar median inhibitory concentrations 
(ICs) against MGMT in vitro (22). Additionally, 
MGMT reverses a wide range of O°-alkylguanine 
substrates (23). These data suggested that 
MGMT-proficient cells should repair the O°FEtG 
lesion (5) before it transforms into ICL 8. 
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Results 

Compound synthesis and in vitro evaluation 

We synthesized the imidazotetrazine 4a (also 
known as KL-50) and the triazene 4b (also known 
as KL-85) as vehicles to deliver 2-fluoroethyl 
diazonium (4¢) as well as a series of related 
agents to probe structure-activity relation- 
ships in tissue culture (Fig. 1, D and F). The 
2-fluoropropyl- and 3-fluoropropyl-triazenes 
(10 and 11, respectively) were prepared through 
the diazotization of 4-aminoimidazole-5- 
carboxamide, followed by the addition of the 
respective amine (supplementary materials). 
All other triazenes were prepared according to 
literature procedures (supplementary mate- 
rials). We evaluated the cytotoxicity of our 
compounds in short-term cell viability assays 
against four isogenic LN229 GBM cell lines 
engineered to be proficient or deficient in 
MGMT and/or MMR activity by using short 
hairpin RNAs (shRNAs) targeting MSH2 (re- 
ferred to as MGMT+/-, MMR+/- cells here- 


after) (fig. S2A). This approach allowed us to 
interrogate the relationship between MGMT 
and MMR status and compound activity. 
The IC; values of these agents are shown 
in Fig. 2A, and representative dose-response 
curves are shown in Fig. 2B (additional data 
are presented in fig. S2, B to G). KL-85 (4b) 
retained potency in MGMT-/MMR- cells (ICs = 
27.5 uM), whereas TMZ (1a) was essentially 
inactive (ICs9 = 838 uM). Structure-activity 
studies were consistent with the mechanistic 
pathway shown in Fig. 1E. The 2,2-difluoroethyl 
triazene 9 and the 2-fluoropropyl triazene 10 
possessed reduced potency in MGMT-/MMR- 
cells (fig. S2, B and C), which is in agreement 
with a reduced rate of displacement after 
introduction of an additional fluorine (24) or 
alkyl substituent. The 2-chloroethyl triazene 
12b was modestly potent but not as selective 
for MGMT- cell lines (fig. S2E), which likely 
results from faster, nonselective ICL forma- 
tion arising from chloride displacement. The 
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3-fluoropropyl triazene 11 was essentially in- 
active in all four cell lines, potentially because 
of elimination of the diazonium ion (fig. S2D). 
As expected, the ethyl triazene 13 was inactive 
(fig. S2F). This compound is known to rapidly 


undergo elimination to ethylene gas after con- 
version to ethyl diazonium (25). 

We prepared KL-50 (4a) through the diazo- 
tization of 4-aminoimidazole-5-carboxamide 
followed by the addition of (2-fluoroethyl) 


isocyanate (39% overall yield) (supplementary 
materials). The potency of 4a mirrored that of 
4b in the four cell lines examined (Fig. 2B). To 
benchmark selectivity, we evaluated the exper- 
imental agent mitozolomide (MTZ; 12a) and 
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Fig. 2. KL-50 (4a) displays previously unseen MGMT-dependent, MMR- cells pretreated with 0.01% dimethyl sulfoxide (DMSO) control (CTR) or 10 uM O°BG 


independent cytotoxicity in multiple isogenic cell models. (A) |Cs, values 
derived from short-term viability assays in LN229 MGMT+/-, MMR+/- cells treated 
with TMZ (1a) derivatives. *MGMT Tl = ICs9 (MGMT+/MMR‘+) divided by ICs 
(MGMT-/MMR+). "MMR resistance index (RI) = IC59 (MGMT-/MMR--) divided by ICso 
(MGMT-/MMR+). (B) Short-term viability assay curves for TMZ (1a), CCNU (14), 
KL-85 (4b), and KL-50 (4a) in LN229 MGMT+/-, MMR+/- cells. (€) Clonogenic 
survival curves for TMZ (Ja) in LN229 MGMT+/-, MMR+/- cells, with representative 
images of wells containing 1000 plated cells treated with 30 uM TMZ (Ja). 

(D) Clonogenic survival curves for KL-50 (4a) in LN229 MGMT+/-, MMR+/- cells, 
with representative images of wells containing 1000 plated cells treated with 30 uM 
KL-50 (4a). (E) Short-term viability assay curves for TMZ (1a) in DLD1 MSH6-deficient 
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(+0°BG) for 1 hour before TMZ (Ja) addition to deplete MGMT. (F) Short-term viability 
assay curves for KL-50 (4a) in DLD1 MSH6-deficient cells pretreated with 0.01% 
DMSO control (CTR) or 10 uM O°BG (+0°BG) for 1 hour before KL-50 (4a) addition. 
(G) Short-term viability assay curves for TMZ (1a) in HCT116 MLH1-/- cells or 
HCT116 cells complemented with chromosome 3 carrying wild-type MLH1 (+Chr3) 
pretreated with 0.01% DMSO control or 10 uM O°BG (+0°BG) for 1 hour before 
TMZ (1a) addition. (H) Short-term viability assay curves for KL-50 (4a) in HCT116 
LH1-/- cells or HCT116 cells complemented with chromosome 3 carrying wild-type 
LH1 (+Chr3) pretreated with 0.01% DMSO control or 10 uM O°BG (+0°BG) for 
hour before KL-50 (4a) addition. For (B) to (H), points indicate the mean, and error 
bars indicate SD; n = 3 technical replicates. 
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the clinical nitrosourea lomustine (also known 
as CCNU; 14), which have been studied with 
hopes of addressing TMZ (1a) resistance (26, 27). 
However, these 2-chloroethylating agents were 
only approximately four- to sevenfold selective 
for MGMT-deficient cells (fig. S2G and Fig. 2B, 
respectively), as opposed to the ~25-fold selec- 
tivity seen with the 2-fluoroethylating agent 
KL-50 (4a). 

We validated the antitumor activity of KL-50 
(4a) in clonogenic survival assays (CSAs) and 
additional cell lines in vitro. TMZ (1a) pos- 
sessed negligible activity in MGMT+ LN229 
cells, irrespective of MMR status, and induced 
robust tumor cell killing in MGMT-, MMR+ 
cells that was abolished in isogenic cells lack- 
ing MMR (Fig. 2C). Lomustine (14) was ef- 
fective in MMR- cells but was cytotoxic to 
MGMT: cells (fig. S2H). By contrast, KL-50 
(4a) demonstrated robust antitumor activity 
in MGMT- cells, independent of MMR status, 
with negligable toxicity in MGMT+ cells at 
doses of up to at least 200 uM (Fig. 2D). We 
observed a similar pattern of activities in several 
distinct cell lines across different tumor types 
with intrinsic or induced loss of MGMT and/ 
or MMR activity. For example, TMZ (1a) was 
inactive in DLD1 cells, which possess MGMT 
but lack functional MMR (MSH6-), with or 
without induced depletion of MGMT by using 
O%-benzylguanine (O°BG) (Fig. 2E) (28). By 
contrast, KL-50 (4a) was efficacious in these 
cells, but only after O°BG-induced MGMT 
depletion (Fig. 2F). TMZ (la) was inactive in 
HCT116 colorectal cancer cells, which lack the 
MMR protein MLH1, regardless of MGMT 
levels (Fig. 2G). Restoration of MMR activity 
by means of complementation with chromo- 
some 3 containing MLH1 resulted in the ex- 
pected enhanced sensitivity to TMZ (1a), which 
was further potentiated by MGMT depletion 
(Fig. 2G). By contrast, KL-50 (4a) induced se- 
lective tumor cell killing specifically in the set- 
ting of O°BG-induced MGMT suppression in 
both MLH1-deficient and MLH1-complemented 
cells (Fig. 2H). The MMR status and O°BG- 
induced loss of MGMT expression was con- 
firmed with Western blot analysis for these 
cell lines (fig. S2, I and J). We also confirmed 
the activity of KL-50 (4a) in MGMT- LN229 
cells engineered to lack expression of other 
key MMR proteins, including MSH6, MLH1, 
PMS2, and MSH3 (fig. $3). The inability of 
MSH3 depletion to induce TMZ (1a) resis- 
tance is an anticipated result, given that the 
primary role of the MSH2-MSH3 heterodimer 
is recognition of insertion-deletion loops, 
whereas single base pair mismatches induced 
by TMZ are processed by the MSH2-MSH6 
heterodimer and subsequent downstream MMR 
proteins (29, 30). Last, we compared the cyto- 
toxicity of KL-50 (4a) and TMZ (1a) in normal 
human fibroblast cells and observed no in- 
crease in toxicity with KL-50 (4a) (fig. S2K). 
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These data suggest that KL-50 (4a) is a first-in- 
class molecule that overcomes MMR mutation- 
induced resistance while retaining selectivity 
for tumor cells that lack MGMT. 


Mechanism of action of KL-50 (4a) 


We used a well-established comet assay ad- 
apted for ICL detection to determine whether 
ICLs were formed in MGMT- cells treated 
with KL-50 (4a) (Fig. 3, A and B). In this assay, 
cells were sequentially exposed to genotoxins 
and ionizing radiation (IR) and then analyzed 
by means of single-cell alkaline gel electro- 
phoresis. Attenuation of the [R-induced comet 
tail is indicative of ICL formation. As expected, 
in the absence of IR, TMZ (1a) (200 uM) and 
KL-50 (4a) (200 uM) both induced tailing in 
MGMT-/MMB+ cells, whereas mitomycin C 
(MMC) (0.1 or 50 uM) did not (37). Exposure to 
50 uM MMC for 2 hours abolished the IR- 
induced comet tail, whereas exposure to 0.1 uM 
MMC [chosen to be ~10-fold greater than the 
ICs for this drug, which is comparable with 
200 uM KL-50 (4a) or TMZ (1a)] for 24 hours 
caused a partial reduction in the IR-induced 
comet tail. TMZ (1a) (200 uM) did not reduce 
DNA migration after IR, which is in agreement 
with its known function as a monoalkylation 
agent with no known cross-linking activity. By 
contrast, KL-50 (4a) (200 uM) reduced the 
percent of DNA in the tail after IR to levels 
similar to those seen for 0.1 uM MMC. We ob- 
served a similar pattern of comet tail migration 
for MMC and KL-50 (4a) in MGMT-/MMR- 
cells, which supports an MMR-independent 
cross-linking mechanism. Comparable results 
were observed in MGMT-/MMR- cells treated 
with KL-85 (4b) (fig. S4, A and B). 

We carried out this assay at varying time 
points (2 to 24 hours) to assess the rates of ICL 
formation in MGMT-/MMR- cells treated with 
KL-50 (4a), MTZ (12a), or TMZ (1a) (Fig. 3, C 
and D). The chloroethylating agent MTZ (12a) 
reduced DNA mobility within 2 hours, which 
is consistent with the cell line selectivities 
above and literature reports that this agent 
rapidly forms ICLs (32) through chloride dis- 
placement from sites of 2-chloroethylation 
other than O°G. TMZ (1a) did not induce a 
statistically significant decrease in DNA migra- 
tion within 24 hours. However, we observed a 
time-dependent decrease in DNA mobility in 
cells treated with KL-50 (4a), with the largest 
difference observed between 8 and 24 hours; 
this is consistent with the reported half-life of 
0°-(2-fluoroethyl)guanosine (S1) (18.5 hours) 
(fig. SLA) (79) and that of O°FEtG (6) (18.5 hours). 
In the unirradiated samples, KL-50 (4a), MTZ 
(12a), and TMZ (1a) all induced maximal dam- 
age at 2 hours; this damage decreased over 
time, which is consistent with progressive DNA 
repair (fig. S4, C and D). Analysis of genomic 
DNA isolated from LN229 MGMT-/MMR+ 
cells treated with KL-50 (4a, 200 uM) or KL-85 


(4b, 200 uM) by denaturing gel electrophore- 
sis (33) demonstrated the presence of cross- 
linked DNA (Fig. 3E). TMZ (1a) and MTIC (1b) 
showed no evidence of ICL induction. Sim- 
ilarly, linearized pUC19 plasmid DNA treated 
with KL-50 (4a) (100 uM) also possessed ICLs, 
with delayed rates of formation relative to 
those of 12b (Fig. 3F). Collectively, these data 
support a mechanism of action for KL-50 (4a) 
that involves the slow generation of DNA ICLs 
in the absence of MGMT. 

To probe for alternative mechanisms of ac- 
tion, we conducted experiments to assess for 
involvement of nucleotide excision repair (NER) 
or BER in lesion processing, generation of re- 
active oxygen species (ROS), and DNA duplex 
destabilization. Short-term cell viability assays 
in isogenic mouse embryonic fibroblasts (MEFs) 
proficient or deficient in XPA, a common shared 
NER factor, revealed no differential sensitivity, 
either with or without O°BG-induced MGMT 
depletion (fig. S5A). N7-MeG lesions induced 
by TMZ (1a) are prone to spontaneous de- 
purination, apurinic (AP) site formation, and 
single-strand breaks (SSBs), which are all known 
BER substrates. To probe for potential differ- 
ential induction of BER substrates by KL-50 
(4a) compared with TMZ (1a), we performed 
in vitro supercoiled plasmid DNA assays to 
measure the formation of AP sites (34). We 
observed similar levels of spontaneous and 
enzyme-catalyzed SSBs from AP sites after treat- 
ment with KL-50 (4a) and TMZ (1a), suggest- 
ing comparable levels of depurination (fig. S5B). 
Cotreatment with increasing concentrations 
of the ROS scavenger N-acetylcysteine (NAC) 
did not rescue cell viability (fig. S5C). Melting 
point analysis did not reveal any notable dif- 
ferences in DNA stability resulting from fluo- 
roethylation compared with methylation (fig. 
S5D). These data suggest that NER status, AP 
site induction, ROS, and altered DNA stability 
are peripheral or noncontributory to the effec- 
tiveness of KL-50 (4a). 

We characterized the profile of DDR activa- 
tion across our four isogenic cell lines after 
treatment with KL-50 (4a) or TMZ (1a). Our 
prior finding that the ATR-CHK1 signaling axis 
is activated in response to TMZ (1a)-induced 
replication stress in MGMT-deficient cells (35) 
prompted us to analyze the phosphorylation 
status of CHK1 and CHK2 in LN229 MGMT+/- 
and MMR+/- cells. KL-50 (4a) induced CHK1 
and CHK2 phosphorylation in MGMT- cells 
regardless of MMR status, whereas TMZ (1a) 
only induced phospho-CHK1 and -CHK2 in 
MGMT-/MMBR:+ cells (fig. S6A). We analyzed 
foci formation of the DDR factors phospho- 
SER139-H2AX (yH2AX), p53 binding protein 
1 (53BP1), and phospho-SER33-RPA2 (pRPA) 
over the period of 2 to 48 hours (Fig. 4, A to D, 
and fig. S6, B and C) (36). KL-50 (4a) induced a 
maximal foci response at 48 hours, specifically 
in MGMT- cells and irrespective of MMR status 
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Fig. 3. Unrepaired primary KL-50 (4a) lesions convert to DNA ICLs in the 
absence of MGMT. (A) Scatter dot plots of the percent DNA in tail upon 
single-cell alkaline gel electrophoresis performed on LN229 
MGMT-/MMR- cells treated with 0.2% DMSO control, 200 w 
200 uM KL-50 (4a), or 0.1 uM MMC (MMC*) for 24 hours or with 50 uM MMC 
(MMC**) for 2 hours. After cell lysis, comet slides were irradiated with 0 

or 10 grays (Gy) before alkaline electrophoresis. Lines indicate median; error 
bars indicate 95% confidence interval (Cl); n = 160 comets per condition. 

(B) Representative comet images from (A). (C) Scatter dot plots of the percent 
DNA in tail upon single-cell alkaline gel electrophoresis performed on LN229 
MGMT-/MMR- cells treated with 0.2% DMSO control, 200 u 
200 uM TMZ (Ja), or 200 uM KL-50 (4a) for 2, 8, or 24 hours. After cell lysis, 


(4a). TMZ (1a) induced a comparable response 
in MGMT- cells, but this was abolished in the 
absence of functional MMR, which is consis- 
tent with known MMR-silencing-based resis- 
tance (16). We observed a reduced level of foci 
formation in MGMT+/MMBRz- cells that was ab- 
sent in MGMT+/MMR- cells, suggesting an 
MMR-dependent DDR in these cells. However, 
these foci dissipate at later time points (72 to 
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96 hours) (fig. S6D) and are not associated with 
appreciable cellular toxicity (Fig. 2, C and D). 
KL-50 (4a) induced increasing G2 arrest on 
progression from 24 to 48 hours in MGMT-/ 
MMR-+ cells as determined with simultaneous 
analysis of DNA content based on nuclear 
(Hoechst) staining in the foci studies above 
(Fig. 4E and fig. S7, A and B). As previously 
reported for other sources of DNA damage, 


= ee 


comet slides were irradiated with 10 Gy before alkaline electrophoresis. Lines 
indicate median; error bars indicate 95% Cl; n = 230 comets per condition. Data 
from samples treated with 0 Gy are shown in fig. S4, C and D. (D) Representative 
comet images from (C). (E) Denaturing gel electrophoresis of genomic DNA isolated 
GMT-/MMR-+ cells treated with 0.2% DMSO control, 200 uM KL-50 
TMZ (Ja), 200 
24 hours or with 50 uM MMC or 200 uM CCNU (14) for 2 hours. (F) Denaturing 
gel electrophoresis of linearized 
cisplatin (36 hours), 
4a), or 12b for 6 to 36 hours. For (E) and (F), bands indicating cross- 
linked DNA are indicated with arrows. Quantification of bands in (F) is 


KL-85 (4b), or 200 uM MTIC (1b) for 


00 ng pUC19 plasmid DNA treated in vitro 
100 uM MMS (36 hours), 100 uM of 


KL-50 (4a) induced an attenuated G2 arrest in 
MGMT-/MMER- cells, which is consistent with 
a role of MMR in the G2-checkpoint (37). This 
effect in MGMT-/MMR- cells was absent after 
TMZ (1a) treatment. Both TMZ (1a) and KL-50 
(4a) induced a moderate G2 arrest in MGMT+ 
/MMBR-+ cells. 

We quantified the levels of DDR foci across 
the individual cell cycle phases (fig. S8). KL-50 
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Fig. 4. KL-50 (4a) activates DDR pathways and cycle arrest in MGMT- cells, 
independent of MMR, and cells deficient in ICL or HR repair are sensitized to 
KL-50 (4a). (A) yH2AX, (B) 53BP1, and (C) pRPA foci formation quantified by percent 
of cells with 210 foci in LN229 MGMT+/-, MMR+/- cells treated with 0.1% DMSO 
control, 20 uM KL-50 (4a), or 20 uM TMZ (1a) for 48 hours. Columns indicate the 
mean; error bars indicate SD; n = 5 technical replicates. Additional time course data 
are presented in fig. S6, B to D. (D) Representative foci images of data in (A) to (C). 
(E) Percentage of cells in G, S, and Go cell cycle phases after treatment as in (A) 

to (C), measured by using integrated nuclear (Hoechst) staining intensity. Columns 
indicate the mean; error bars indicate SD; n = 3 independent analyses. Additional time 
course data, cell cycle controls, and representative histograms are presented in fig. S7. 
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(F) Change in percent cells with 21 micronuclei from baseline (D 
treatment as in (A) to (C). Columns indicate the mean; error bars indicate SD; 

n = 15 technical replicates; ****P < 0.0001; ns, not significant. Additional validation 
is presented in fig. S9, A and B. (G) Short-term viability assay curves for KL-50 (4a) in 
PD20 cells, deficient in FANCD2 (FANCD2-/-) or complemented with FANCD2 
(+FANCD2). (H) Short-term viability assay curves for KL-50 (4a) in PEO4 (BRCA2+) 
and PEO1 (BRCA2-/-) cells pretreated with 0.01% DMSO control or 10 uM O°BG 
(+0°BG) for 1 hour before KL-50 (4a) addition. (I) Short-term viability assay curves for 
KL-50 (4a) in DLD1 BRCA2+/— and BRCA2-/- cells pretreated with 0.01% DMSO 
control or 10 uM O°BG (+0°BG) for 1 hour before KL-50 (4a) addition. For (G) to (1), 
points indicate the mean, and error bars indicate SD; n = 3 technical replicates. 


29 JULY 2022 « VOL 377 ISSUE 6605 


-*-PEO1 -* PEO1 (+O®%BG) 
i 


100 


[KL-50 (4a)] (uM) 


-* BRCA2+/- 
-=- BRCA2+/— (+0°BG) 
-a- BRCA2-/- 
—* BRCA2-/— (+O°BG) 


SO control) after 


507 


RESEARCH | RESEARCH ARTICLES 


(4a) induced foci formation primarily in the S 
and G, phases of the cell cycle, which is con- 
sistent with replication blocking by ICLs (38). 
Foci increased in MGMT- G,-phase cells at 
48 hours, suggesting that a fraction of cells 
may progress through the S phase with un- 


repaired DNA damage. TMZ (1a) displayed a 
similar pattern of foci induction in the S and 
G, phases, with smaller increases in G,-phase 
foci. We concurrently assessed micronuclei 
formation as an indication of replication stress- 
induced chromosomal instability (Fig. 4F). Con- 


sistent with previously reported data in MSH2-/- 
MEFs in which MMR loss suppressed micronuclei 
formation arising from replication-stress (39), 
MME-deficient LN229 cells displayed suppres- 
sion of micronuclei formation induced by 
KL-50 and TMZ in an MGMT+ background. 
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Fig. 5. KL-50 (4a) appears to be safe and efficacious in both MGMT-/MMR+ 
and MGMT-/MMR- flank tumors over a wide range of treatment regimens 


and conditions. (A) Xenograft LN229 MGMT-/ 


MR+ flank tumors treated with 


three weekly cycles of oral administration of 10% cyclodextrin control (n = 7 mice), 


TMZ (la) (n = 7 mice, 5 mg/kg) or KL-50 (4a) 
Wednesday, and Friday (individual spider plots i 
MGMT-/MMR- flank tumors treated with three 
of 10% cyclodextrin control (n = 6 mice), TMZ 
KL-50 (4a) (n = 5 mice, 5 mg/kg) on Monday, 


(n = 6 mice, 5 mg/kg) on Monday, 
n fig. SIOA). (B) Xenograft LN229 
weekly cycles of oral administration 
la) (n = 5 mice, 5 mg/kg), or 
Wednesday, and Friday (individual 


spider plots in fig. S1OB). (©) Mean body weight of mice during LN229 flank tumor 


experiments in (A) and (B). (D to F) Xenograft 


D) LN229 MGMT-/MMR+ and 


(E) LN229 MGMT-/MMR- flank tumors treated with oral administration of 10% 


cyclodextrin control (n = 7 mice), KL-50 (4a) 
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(n = 6 mice, three cycles of 


15 mg/kg on Monday, Wednesday, and Friday), KL-50 (4a) (n = 6 mice, 1 cycle of 
25 mg/kg Monday through Friday), or intraperitoneal administration of KL-50 
(4a) (n = 7 mice, three cycles of 5 mg/kg on Monday, Wednesday, and Friday) 


revealed equal efficacy with no observable increases in toxicity, (F) as measured 
with mice systemic weights (individual spider plots are provided in fig. S1OC). 


(G) Xenograft LN229 MGMT-/MMR+ and LN229 MGMT-/' 
with a larger average starting tumor size of ~400 mm® and 
respectively, treated with three weekly cycles of oral admini 


SH6- flank tumors 
~350 mm, 
stration of 10% 


cyclodextrin (n = 4 mice) or KL-50 (4a) (n = 4 mice, 25 mg/kg on Monday, 
Wednesday, and Friday). The study period was limited by control groups that had 
to be euthanized for exceeding the ethical maximum allowed tumor size, thus 


ending the study. In (A) to (G), points indicate the mean, and 


error bars indicate 


SEM; *P < 0.05; **P < 0.01; ***P < 0.001; ns., not significant. 
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However, KL-50 (4a), but not TMZ (1a), was 
able to induce micronuclei in MGMT-/MMR- 
cells, which suggests that the ICLs formed in 
these cells are able to induce chromosomal 
instability even in the absence of MMR. These 
findings are in agreement with the differential 
toxicity profiles of KL-50 (4a) and TMZ (la): 
KL-50 (4a) induces multiple successive markers 
of DNA damage and engagement of the DDR 
in MGMT- cells, independent of MMR status, 
whereas the effects of TMZ (1a) are similar 
to those of KL-50 in MGMT-/MMBR- cells but 
absent in MMR- cells. Coupled with the ICL 
kinetics data presented above, these time-course 
data support a slow rate of ICL induction in situ 
by KL-50 (4a). 

These foci data suggest that KL-50 (4a) 
induces replication stress (as indicated by 
pRPA foci formation) and formation of DSBs 
(as indicated by yH2AX and 53BP1 foci), 
which are known to follow the formation of 
ICLs (38). Consistent with this, BRCA2- and 
FANCD2-deficient cells are hypersensitive to 
KL-50 (4a) (Fig. 4, G to I, and fig. S9, C to F). 
In two MGMT-proficient cell models, BRCA2 
loss enhanced the toxicity of KL-50 (4a) after 
MGMT depletion through O°BG (Fig. 4, H 
and I). We observed FANCD2 ubiquitination 
by KL-50 (4a) specifically in MGMT- cells, 
suggesting activation of the Fanconi anemia 
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(FA) ICL repair pathway (fig. S9G). As previ- 
ously reported (40), TMZ (1a) also induced 
FANCD2 ubiquitination, but only in MGMT- 
/MMR+ cells. This is an expected result given 
the propensity of O°MeG (3) to induce stalled 
replication forks through the process of futile 
cycling (41). 


KL-50 (4a) is active against TMZ (1a)-resistant 
tumors in vivo 


We evaluated the activity of KL-50 (4a) and 
TMZ (1a) in vivo using mouse flank tumor 
models derived from the isogenic LN229 
MGMT- cell lines. We treated MGMT-/MMR+ 
and MGMT-/MMR- flank tumors with KL-50 
(4a) or TMZ (1a) (5 mg/kg orally Monday, 
Wednesday, and Friday for 3 weeks) as pre- 
viously described for TMZ (1a) (35). As ex- 
pected, TMZ (1a) suppressed tumor growth in 
the MGMT-/MMR+ tumors (Fig. 5A). KL-50 
(4a) was statistically noninferior to TMZ (1a), 
despite a 17% lower molar dosage owing to 
its higher molecular weight. In the MGMT- 
/MMR- tumors, TMZ (1a) demonstrated no 
efficacy, whereas KL-50 (4a) potently sup- 
pressed tumor growth (Fig. 5B). KL-50 (4a) 
treatment resulted in no statistically signifi- 
cant changes in body weight compared with 
that of TMZ (1a) or control (Fig. 5C). KL-50 
(4a) was effective and nontoxic across differ- 
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ent dosing regimens (5, 15, and 25 mg/kg), 
treatment schedules (Monday, Wednesday, 
and Friday for 3 weeks, and Monday through 
Friday for 1 week), and routes of drug admin- 
istration (oral or intraperitoneal) in mice bear- 
ing MGMT-/MMR+ and MGMT-/MMR- flank 
tumors (Fig. 5, D to F). KL-50 (4a) (25 mg/kg 
orally Monday, Wednesday, and Friday for 
3 weeks) potently suppressed the growth of 
large (~350 to 400 mm?) MGMT-/MMR+ 
and MGMT-/MSH6- tumors (Fig. 5G). KL-50 
(4a) (25 mg/kg orally Monday through Friday 
for 1 week) was also effective in an orthotopic, 
intracranial LN229 MGMT-/MMR- model, 
whereas TMZ (1a) only transiently suppressed 
tumor growth (Fig. 6, A and B). 

A focused maximum tolerated dose study 
revealed that KL-50 (4a) is well tolerated. 
Healthy mice were treated with escalating 
doses of KL-50 (4a) (0, 25, 50, 100, and 200 mg/ 
kg orally for 1 dose) and monitored over time 
for changes in both weights and hematologic 
profiles. Mice in the higher dosage groups (100 
or 200 mg/kg) experienced a greater than 10% 
weight loss after treatment, which regressed to 
baseline at the end of 1 week (Fig. 6C). Two of 
three mice in the 200 mg/kg treatment group 
became observably ill, warranting euthanasia, 
but no evidence of toxicity was observed in 
the remaining cohorts. Because the main 
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Fig. 6. KL-50 (4a) is efficacious in an LN229 MGMT-/MMR- intracranial 
model and is well tolerated with limited myelosuppression at supratherapeutic 
doses. (A) Mean tumor size as measured with bioluminescent imaging as relative 
light units (RLU; photons/s) with SEM of xenograft LN229 MGMT-/MMR- intracranial 
tumors treated with 5 consecutive days of oral administration of 10% cyclodextrin 
control (n = 10 mice), TMZ (1a) (n = 11 mice, 25 mg/kg), or KL-50 (4a) (n = 11 mice, 
25 mg/kg) (individual spider plots are provided in fig. S1OD). (B) Kaplan-Meier 
analysis of intracranial xenograft tumor-bearing mice in (A), treated with 5 
consecutive days of oral administration of 10% cyclodextrin control (n = 10 mice), 
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TMZ (Ja) (n = 11 mice, 25 mg/kg), or KL-50 (4a) (n = 11 mice, 25 mg/kg) 
demonstrating a significant survival benefit for KL-50 (4a) compared with both control 
and TMZ (1a) groups. (©) Mean body weight change with SEM of mice during 
maximum tolerated dose experiment in non—tumor-bearing mice. (D) Complete blood 
counts for mice before treatment and 7 days after treatment with escalations of 
single-dose KL-50 (4a) delivered orally. White blood cells (WBC) lower limit of normal 
(LLN), 2.2 k/uL; neutrophil LLN, 0.42 k/ul; lymphocyte LLN, 1.7 k/ul; red blood 
cells (RBC) LLN, 3.47 M/ul; platelet LLN, 155 k/l. *P < 0.05; ****P < 0.0001; ns.; 
not significant. 
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dose-limiting systemic toxicity of TMZ (1a) is 
myelosuppression, we measured complete blood 
counts for all mice on day O before treatment 
and subsequently on day 7 after drug admin- 
istration. Overall, there was a trend toward re- 
duced neutrophil and lymphocyte counts with 
KL-50 (4a) treatment, although average blood 
counts were within normal physiological ranges 
(defined as values falling within 2 SD of the 
average for healthy mice) for all cohorts (Fig. 
6D). Taken together, these data demonstrate 
in vivo efficacy, acceptable systemic tolerabil- 
ity, and central nervous system penetrance of 
KL-50 (4a). 


Discussion 


MMR mutation-induced resistance to alkylat- 
ing agents has been a major barrier to treat- 
ment efficacy since the introduction of TMZ 
(1a) in the early 1990s. Chloroethy] alkylating 
agents, such as lomustine (14) and MTZ (12a), 
lack a TI owing to indiscriminate cross-linking 
activity through chloride displacement (32, 42). 
Fluoride is not normally susceptible to bimo- 
lecular displacement [the homolytic bond dis- 
sociation energies of aliphatic C-F and C-Cl 
bonds are ~115 kcal/mol (481 kJ/mol) and 
83.7 kcal/mol (350 kJ/mol), respectively]. How- 
ever, the appropriate positioning of hydrogen 
bond donors or covalently attached nucleo- 
philes can promote substitution (43). Thus, the 
intramolecular displacement pathway (5-6) 
appears to provide an essential acceleration 
in the formation of ICLs by KL-50 (4a) while 
maintaining sufficient lifetime for MGMT- 
mediated repair of the O°FEtG (5) adduct. 
This hypothesis is consistent with the reported 
half-lives of 0°-(2-fluoroethyl)guanosine (S1) 
and 0°-(2-chloroethyl)guanosine (S4) (hyo = 
~18.5 hours and ~18 min) (fig. S1, A and B) 
(19, 44) and the reported resistance of N7-(2- 
fluoroethyl)guanosine (S5) to bimolecular 
fluoride displacement (fig. SIC) (45). We hy- 
pothesize that the rate of MGMT reversal 
of O°FEtG (5) is comparable with the corre- 
sponding ethyl and n-propyl adducts, which 
are processed at 25 to 50% the rate of O°MeG 
(3) in vitro (46). It has been reported that 
the mean lethal dose of ICLs in HeLa cells is 
230 (47), and TMZ (1a) has been demonstrated 
to yield 20,600 O°MeG (3) adducts per cell 
at a dose of 20 uM (48). Assuming that a sim- 
ilar level of O°FEtG (5) lesions are induced 
by KL-50 (4a) at the ICs9 (~20 uM) in MGMT- 
/MMR- LN229 cells, the number of adducts 
required to convert to ICLs to generate the 
mean lethal dose is ~1 in 90, or ~1.1% cross- 
linking efficiency. 

Beyond MGMT-silenced recurrent glioma, it 
is possible that there may be additional bene- 
ficial indications for selective targeting of can- 
cer cells with KL-50 (4a). MGMT silencing has 
been reported in 40% of colorectal cancers and 
25% of non-small cell lung cancer, lymphoma, 
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and head and neck cancers (49). MGMT mRNA 
expression is also reduced in subsets of addi- 
tional cancer types, including breast carcinoma, 
bladder cancer, and leukemia (50). MMR loss, 
as reported by microsatellite instability, is a 
well-established phenomenon in multiple can- 
cer types and leads to resistance to various 
standard-of-care agents. 

Our data also suggest that KL-50 (4a) will 
display a higher TI in tumors with MGMT de- 
ficiency and impaired ICL repair, including HR 
deficiency. The TI of KL-50 (4a) in the DLD1 
isogenic model, as measured by the ratio of 
ICso values in MGMT+/BRCA2+ cells compared 
with MGMT-/BRCA2- cells, was ~600-fold, 
which is larger than canonical cross-linking 
agents cisplatin (42-fold) or MMC (26-fold). 
A similar amplification of the TI was seen in 
the PEO1/4 model with KL-50 (4:a) (62-fold) 
versus cisplatin (13-fold) or MMC (7-fold). HR- 
related gene mutations have been detected in 
a substantial number of tumors across multi- 
ple cancer types (17.4% in 21 cancer lineages), 
and methods have been developed to assess 
for tumor-associated HR deficiency (57). 

Approximately half of patients with recur- 
rent MGMT-methylated glioma acquire TMZ 
(1a) resistance through loss of MMR (J8, 52). 
KL-50 (4a) is designed to fill this therapeutic 
void. We expect KL-50 (4a) to be amenable to 
derivatization for improved drug pharmaco- 
kinetic properties on the basis of prior work 
with the imidazotetrazine scaffold (53), facil- 
itating translation to clinical trials. Broadly, 
incorporating a consideration of the relative 
rates of DNA modification and DNA repair 
pathways into the design of therapeutics may 
lead to the development of additional selec- 
tive chemotherapies. 
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Analysis of somatic mutations in 131 human brains 
reveals aging-associated hypermutability 
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We analyzed 131 human brains (44 neurotypical, 19 with Tourette syndrome, 9 with schizophrenia, 

and 59 with autism) for somatic mutations after whole genome sequencing to a depth of more than 
200x. Typically, brains had 20 to 60 detectable single-nucleotide mutations, but ~6% of brains 
harbored hundreds of somatic mutations. Hypermutability was associated with age and damaging 
mutations in genes implicated in cancers and, in some brains, reflected in vivo clonal expansions. 
Somatic duplications, likely arising during development, were found in ~5% of normal and diseased 
brains, reflecting background mutagenesis. Brains with autism were associated with mutations creating 
putative transcription factor binding motifs in enhancer-like regions in the developing brain. The 
top-ranked affected motifs corresponded to MEIS (myeloid ectopic viral integration site) transcription 
factors, suggesting a potential link between their involvement in gene regulation and autism. 


omatic mutations naturally occur in 
proliferative and postmitotic cells through- 
out human development and during 
aging, starting from the first cleavage 
of the zygote (/-4). It has yet to be 
determined how frequent somatic mutations 
are in the population and whether they are a 
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contributing factor to the etiology of neuro- 
psychiatric disorders (5). Mutations analyzed 
in bulk tissues are typically present in >1% of 
cells, often arise during early development, 
and may be enriched for variants exerting a 
stronger phenotypic effect. We investigated 
somatic mosaicism in normal human brains 
(n = 44) and in brains of those affected by one 
of three neuropsychiatric diseases: Tourette 
syndrome (TS; 7 = 19), schizophrenia (SCZ; n = 
9), or autism spectrum disorder (ASD; 7 = 59). 


Results 
Mutation discovery and counts across cohorts 


For each frozen postmortem brain, one to two 
regions (cortex, striatum, or hippocampus) 
were extracted (~1 cm?) at one of three insti- 
tutions: Yale University, the Lieber Institute 
for Brain Development (LIBD), or Harvard 
University [the data were reported previously 
(6)]. DNA extracted from bulk samples, from 
fluorescence-activated cell sorted neuronal 
and glial cell fractions, or from both was analyzed 
by whole genome sequencing (WGS) on an 
Illumina platform at various depths, resulting 
in acombined coverage of >200x per brain for 
92% of the samples, and coverage as high as 
620 (fig. S1 and table S1). The data were 
uniformly processed—reads were aligned and 
somatic point mutations were called in all 
samples using a previously verified workflow 
(7). Somatic mutations were distinguished 


from inherited variations on the basis of both 
their frequency and the lack of overlap with 
the catalogs of known germline variations in 
the human population. 

In brains with bulk samples sequenced to at 
least 200x coverage (tier 1 set), we discovered 
20 to 60 somatic single-nucleotide mutations 
per brain with typical allele frequencies ranging 
from 1 to 10% across all cohorts (Fig. 1A, fig. S2B, 
and table S2). Additional mutations (included 
in the tier 2 set) were discovered when con- 
sidering additional sequencing coverage includ- 
ing data for cell fractions (see methods in the 
supplementary materials), but for an unbiased 
comparison across cohorts, we used the tier 
1set unless otherwise noted. In every cohort, 
there was at least one brain with an outlier 
high count (i.e., above the upper outer fence 
of 101) of somatic mutations, which we refer to 
as a hypermutable brain (Fig. 1A and methods). 
The mutation calls in those brains had the same 
substitution spectrum as for other brains (Fig. 
1, B and C, and fig. S2E). From read-backed 
phasing of calls to personal DNA haplotypes, 
the proportions of supported (i.e., assigned to 
a single haplotype) and unsupported (i.e., with 
evidence of being at two haplotypes) calls in 
the hypermutable brains were the same as in 
other brains (Fig. 1D), indicating the same 
quality of calls across all brains. Thus, we 
concluded that calls in hypermutable brains 
represent bona fide somatic mutations. Fur- 
thermore, we conducted mutation validation 
in two hypermutable brains, NC7 and LIBD82. 
In brain NC7, at least 87.5% of mutation calls 
were validated using single-cell sequencing 
(Fig. 2). Similarly, we validated at least 80% of 
mutations in transcriptome in brain LIBD82 
using RNA sequencing (methods). 

When we excluded the hypermutable brains, 
there were no significant differences in somatic 
mutation burden between diseased and normal 
brains, the allele frequency of the somatic 
mutations, or the somatic mutation spectrum 
(Fig. 1, C and E). The combined mutation 
spectrum was dominated by C>T transitions 
in CpG motifs and matched the spectrum of 
mutations arising during development (Fig. 1, 
B and C, and fig. S2, E and G) (J, 7, 8). Brains 
in each cohort had different age distributions, 
but the mutation burden in nonhypermutable 
brains did not correlate with age (fig. S2F), 
which is also consistent with somatic muta- 
tions being of early developmental origin. In 
contrast, the proportion of hypermutable 
brains rose with age (P = 8.2 x 10°? by x” test 
for trend), suggesting that those brains have 


29 JULY 2022 * VOL 377ISSUE 6605 511 


RESEARCH | RESEARCH ARTICLES 


> 


[=] Ts (1,046) 

E) ASD (2,249) 

7M SCZ (LIBD82, 1,756) 
[2] SCZ (Others, 560) 
[2] Normal (1,939) 


i= 
400 2 
:e.: : 
300 | : : : < 
o : : @ : 8 
& Gm : ee 1 fast: As 8 
@ 2004 . : * > 
> : . . @ 
2 . . . oO 
© 1004 : : : i 
7 oe: 
a : : LIBDEZ® ; % Cc 
. Ts9 ! B & 
e: 7 : g 2 
= Ligpgs = NC7% E 2 
: 4 a € 
= : = 8 
5 Hi eo: : 
8 = ANO5983 nice : LIBD96y £ 
5 lp ne & 
c= . . oO 
s . . . ao 
5 : : r : 
= 7 : 7 : D. 
: : : s 
a : 2 a 
104 : 5 | 
= 1s) 
i @ lot supported 
TS ASD SCZ Normal a Supported 
(N=15) (N=59)  (N=9) _—-(N=37) 3 Unphased 
1,000 600 
- c 
3 ‘ 304 9 500 
(= oO 
3 : & z 
8 Ale - 5 400 
5 20% 8 8 
= £ c 
s 2 & 300 
5 g 
= = g§ 
10% & = 200 
= 
100 
0% 


80 


Fig. 1. Mutations discovered across cohorts of brains. (A) Summary of 
coverage and detected somatic point mutations across cohorts 
samples. Mutation burden across cohorts was comparable when excluding hyper- 
mutable brains with >101 mutations (named in the plots). Different colors indicate 
the institutions that processed the brain samples: Yale University, blue; Harvard 
University, orange; and LIBD, red. Colored bars show the median mutation 


VAF 


using bulk brain 


Copy number 


m All combined SNVs 
m Early in fetal brains (Bae et al.) 


Cosine similarity=0.907 


Early 
in fetal brains 
y (Bae et al.) 


Mmm T>G 
i T>C 
mm T>A 
mm C>T 
ME C>G 
CHA 


AHypermutable 


DLPFC 


Hippocampus 


ai) fa 154 
12 14 1618 21 XY 
19 22 


7° 2°3°4°5°6'7 8510 
Chromosome 


haplotype using nearby heterozygous SNPs; other calls are indicated as unphased. 
(E) When excluding hypermu 
correlate with age. However, there are more hypermutable brains (those are above 
the dotted line) in older brains (blue histogram with error bars in the background). 
Each data point is an individual brain with colors representing phenotypes: TS, 
blue; ASD, orange; SCZ, red; and normal, gray. Circles indicate males, crosses 
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count per disease cohort. The black bar represents the median mutation count in 
all normal brains. (B) The combined mutation spectrum is dominated by C>T 
transitions in CpG motifs and matches the spectrum of developmental mutations 
(fig. S2). (©) The substitution spectra of detected mutations across all brains are 
comparable. (D) Fractions of supported and unsupported calls by assigning to 
haplotypes are comparable across brains, indicating the same accuracy of calls for 
hypermutable (indicated by arrows) and nonhypermutable brains. Owing to the 
short DNA fragments (~450 base pairs), only ~20% of calls can be assigned to a 


indicate females. (F) Distributions of mutation counts and allele frequencies across 
cohorts. The numbers in parentheses are the mutation counts. Brain LIBD82 is 
an exception; its mutations had higher frequencies than those in other brains. 
The VAF value of the vertical dashed line corresponds to ~15% of aneuploid cells in 
the hippocampus of that brain (which corresponds to displayed average VAF of 
~4% between the cortex and hippocampus). (G) Brain LIBD82 had apparent 
aneuploidies in ~15% of cells in the hippocampus, which may be a signature of 
incipient glioma or glioblastoma. DLPFC, dorsolateral prefrontal cortex. 


mutations arising both in development and 
during aging (Fig. 1E). Specifically, hyper- 
mutable brains constituted 16% of brains 
over 60 years old, but only 2% of those under 
40 years old. One of the hypermutable SCZ 
brains with the highest mutation burden, 
LIBD82, had genomic aneuploidies in the 
hippocampus, including a duplication of chro- 
mosome 7 and a deletion of chromosome 10 
(Fig. 1, F and G), which is a signature of glio- 
blastoma (9). Given that clinical characteriza- 
tion of the brain did not provide any evidence 
of the disease, the presence of aneuploidies 
in this brain is consistent with the idea that 
driver mutations often precede cancer di- 
agnosis by years, if not decades (10). Thus, 
hypermutability in this brain may reflect 
incipient glioblastoma. 
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Origin of hypermutability 

Hypermutable brains were found in patients 
and controls and in every cohort of samples 
generated by each lab. Thus, it is unlikely that 
the large mutation load in these brains is as- 
sociated with disease status or arises from 
cohort- or lab-specific conditions used for 
brain storage, handling, or sample processing. 
We hypothesize that the hypermutability arises 
from individual-specific biological causes. One 
possibility is that cells of such brains are in- 
trinsically hypermutable. However, the fre- 
quency spectrum of mutations in hypermutable 
brains as compared with nonhypermutable 
brains was depleted for higher frequency muta- 
tions [>6% variant allele frequency (VAF)], 
overrepresented for intermediate frequencies 
(3 to 6%), and almost the same for low fre- 


quency mutations (<3% VAF) (fig. S2, B to D). 
These data suggest that intrinsic hypermu- 
tability, if it exists, is not innate (that is, present 
in all cell divisions throughout development) but 
rather occurs sometime during the lifetime of 
that individual. 

Out of six hypermutable brains, two, TS9 and 
NC7, had the same known missense mutation 
(chr1:115,258,747 C>T) in the NRAS oncogene. 
This mutation has been previously identified in 
multiple cancers and is deemed to be pathogenic 
in the Catalogue of Somatic Mutations in Cancer 
(COSMIC) database (ID: COSV54:736383). Fur- 
thermore, three hypermutable brains (NC7, TS9, 
and AN05983) carried 11 mutations predicted to 
have a damaging effect in genes previously 
implicated in cancers (Table 1), such as MTOR, 
TET2, DNMT3A, and IDH2. Overall, hypermutable 
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Table 1. List of detected somatic mutations that putatively affect genes previously implicated in cancers and genes previously associated with 
neuropsychiatric diseases. Some mutations are also annotated as pathogenetic in the COSMIC database (33). A known mutation in NRAS oncogene is 
present in two brains. Validation: A, by amplicon; R, previously reported and validated by Rodin et al. (6); and S, by single cells. Gene annotation: B, cancer 
implicated genes in Bailey et al. (11); C, cancer gene census (12); ASD, SFARI genes; and SCZ, compiled from previous SCZ studies (34-37). 


Mutation Brain 


chr15:90,631,934 C>T* NC7 Normal 


977,081 G>A 


chr2:73,800,041 T>G* — LIBD100 SCZ 


*Tier 2 mutation (see methods). 


brains were enriched for damaging mutations 
in genes implicated in cancers (JI, 12) as com- 
pared with nonhypermutable brains (P = 2.4 
x 10°? by Fischer's exact test; fig. S3A). 

Given the above data, we hypothesized that 
the cell lineage carrying one or more of these 
mutations underwent a clonal expansion, which 
allowed other preceding lineage-specific muta- 
tions to rise to a detectable frequency. In both 
brains TS9 and NC7, virtually all of the muta- 
tions were present in both bulk cortex and 
striatum at an allele frequency of ~2%, as well 
as in multiple other cell fractions at various 
frequencies (Fig. 2, A and B, and fig. S3, B and C). 
This could be explained by clonal hematopoi- 
esis coupled with blood cell infiltration into the 
brain, such that mutations from blood are pres- 
ent in different brain regions. Hence, we in- 
vestigated the expression of blood-specific genes 
(i.e., markers) in the studied brains. Several 
markers consistently indicated that only brain 
NC7 had a higher fraction of blood cells as 
compared with all other brains (fig. $4). How- 
ever, in NC7, all the mutations were present 
in sorted interneurons from the striatum 
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Phenotype Hypermutable VAF 


Validation (VEP) 


In genes implicated in cancer 


Yes 3.55% S 


n/a 


No 4.46% n/a Missense 


(STR-INT) at a much higher frequency (15 to 
25% VAF), suggesting that the expanded lineage 
originated not in the blood but rather in the 
embryonic basal ganglia, where interneurons 
are generated, and then populated the cortex 
and striatum by cell migration. 

We tested the hypothesis of lineage expan- 
sion (either of interneurons or of hematopoietic 
cells) by sequencing 16 single nuclei isolated 
from a different sample of the STR-INT fraction 
from brain NC7. Of all the somatic mutations 
discovered in bulk NC7 samples, 346 (94%) were 
genotyped in 8 of 16 nuclei, with 154 to 197 mu- 
tations genotyped in each nucleus (Fig. 2C). The 
genotyping rate, 42 to 53% across nuclei, was 
comparable to the genotyping sensitivity (34 to 
46%; see methods) allowing us to extrapolate 
that each of the eight nuclei contained almost 
all the mutations discovered in the bulk cortex 
and striatum and proving that the mutations 
belonged to the same cell lineage. 

Unlike in brains NC7 and TS9, mutations in 
other hypermutable brains with data available 
for multiple regions (LIBD82, LIBD96, LIBD98, 
and LIBD107) were restricted to one brain re- 


Consequence 


Gene Genomic Prediction 
Gene als mutation ID (COSMIC) 
tation (COSMIC) 


IDH2 B,C 


COSV56389769 


gion (fig. $3, D to F). For brain LIBD82, the 
frequencies (~15% in the hippocampus) of 
point mutations and of genomic aneuploidies 
were matching, consistent with both mutation 
types being present in expanded lineages (Fig. 
1, E and F). Altogether, the above evidence 
suggests that hypermutability in some brains 
is caused by lineage expansion, either during 
development or later in life. 


Somatic mutations reveal uneven cell lineage 
distribution in the brain 


For each brain processed at Yale University 
(19 TS and 21 neurotypical brains), we obtained 
sequencing data for bulk cortex, striatum, and 
up to eight cell fractions per brain region (see 
methods). In both the cortex and striatum, 
we successfully fractionated neurons and non- 
neuronal cells (NeuN*/NeuN ); in the striatum, 
we also fractionated medium spiny neurons 
(STR-MSN) and interneurons (STR-INT) (fig. 
SIB and methods). From each brain processed 
at LIBD, we obtained sequenced data for 
bulk cortex and hippocampus. The frequency 
of mosaic mutations in multiple regions and 
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Fig. 2. Uneven cell lineage distribution of somatic mutations in brain. 
(A) Mutation allele frequencies in samples from brain NC7. Almost all 
mutations discovered in NC7 are present in the striatal interneuron fraction 
(STR-INT) at high VAFs. CX, cortex. (B) Distribution of mutation allele 
frequencies across samples in NC7. (C) Genotyping of mutations in 


fractions allowed us to explore the cell lineage 
distribution in the brain. 

To investigate whether the above result 
(Fig. 2, A and B) could reflect a more general 
phenomenon, we tested for the difference in 
mutation VAFs between the cortex and stria- 
tum. If the early lineage distribution is the 
same across these two regions, there should 
be, on average, no difference in VAFs between 
them (methods). Out of 22 brains of the Yale 
cohort for which we had a complete dataset 
for the bulks and fractions, eight (six non- 
hypermutable) showed a statistically significant 
difference between the cortex and striatum 
(Fig. 2D and fig. S5, A and B). Except for the 
hypermutable brain NC7, a higher VAF of 
somatic mutations was always observed in the 
cortex. Even in NC7, the VAFs were higher in 
the cortex when excluding neuronal fractions 
(Fig. 2A and fig. S5C). Similarly, in 5 out of 
13 nonhypermutable brains processed by LIBD 
(six SCZ and seven neurotypical brains), we 
also observed a higher VAF in the cortex as 
compared with the hippocampus (Fig. 2D 
and fig. S5, D and E). Combining the brains 
with biased VAFs from the two datasets, we 
observed a significant overrepresentation of 
brains with higher VAF values in the cortex 
(P = 5 x 10 by binomial test). A possible 
explanation of this observation is that the 
founder population of the cortex is allocated 
with fewer earlier lineages, and so the fre- 
quency of each lineage is, on average, higher. 
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50% 


hippocampus (LIBD80). 


Alternatively, the cortex may have a higher 
propensity for expansion in some lineages when 
compared with the striatum and hippocampus. 


Relevance of somatic mutations to genome function 


Given the mostly random distribution of somatic 
mutations in the human genome, the majority 
(~60%) of the mutations were outside genes 
with roughly the same distribution across the 
genome in each cohort (Fig. 3A and fig. S6A). 
Approximately 2 to 3% of mutations were in 
the coding part of the genome, and another 
6% had the potential to affect regulatory re- 
gions. Six putative deleterious mutations in 
genes previously associated with neuropsy- 
chiatric diseases were found in ASD and SCZ 
brains (Table 1). For example, in the ASD brain 
UMB4231, we detected and validated a missense 
mutation in PCDH15—a gene encoding a protein 
mediating calcium-dependent cell-cell adhe- 
sion, which was previously associated with 
ASD (13). Furthermore, in the ASD brain 
ANO05983, we detected and validated a splice 
mutation in MTOR, a gene that is part of the 
insulin-like growth factor 1-phosphatidylinositol 
3-kinase (IGF1-PI3K) signaling pathway impli- 
cated in ASD (J4). 

We did not detect deleterious somatic 
mutations in TS-associated genes, but a mis- 
sense mutation in the ARHGEF6 gene 
(chrX:135,761,734 G>T) was found in brain 
TS9 at a relatively high VAF in both the 
cortex (4.2% VAF) and striatum (1.6% VAF). 


VAFstr— VAFcx 


VAF Hippo — VAFotprc 


16 single nuclei originated from the NC7 STR-INT fraction by sequencing 
at 5x depth. Black bars represent genotyped mutations. (D) Examples of 
brains where frequencies of mutations are significantly biased toward 

the cortex as compared with the striatum (NC12) or as compared with the 


The ARHGEF6 gene has been associated with 
X-linked intellectual disability, dendrite orien- 
tation, and cell polarity (15). Indeed, knockout 
of ARHGEF6 in mouse models results in hip- 
pocampal dendrite and/or synaptic abnormal- 
ities and deficits in learning (6). Deficits in cell 
polarity have been implicated in Tourette syn- 
drome from analyses of rare de novo muta- 
tions (17). 

We next investigated the possible functional 
impact of the discovered mutations in non- 
hypermutable brains on enhancer-like elements 
active in the fetal human brain (78) and in an 
organoid model of the developing human 
brain (methods). For each set of enhancer-like 
elements, analysis of the ASD cohort revealed an 
excess, as compared with controls, of somatic 
mutations creating putative transcription factor 
(TF) binding motifs (P < 10* by binomial test) 
(Fig. 3B and fig. S6C). Top-ranked mutations 
in ASD brains created 16 putative binding 
motifs for the myeloid ectopic viral integra- 
tion site 1 (MEIS1), MEIS2, and MEIS3 tran- 
scription factors (Fig. 3C); in normal brains, 
only four such mutations were detected (not 
shown in Fig. 3C). Those TFs were also among 
the top-ranked when simulating random dis- 
tribution of mutations in ASD brains across 
the genome (not shown in Fig. 3C). 


Somatic structural mutations 


To call structural mutations, we used an ap- 
proach implemented in CNVpytor (Fig. 4). This 
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cohort, mutations from nonhypermutable brains have roughly the same 


distribution across genomes and similar predicted functional impacts by Variant 
Effect Predictor (VEP). UTR, untranslated region. (B) Counts of putative TF 


motifs disrupted by somatic mutations within enhancer-like elements from 


organoids (upper plots) or fetal brain (lower plots), showing that mutations 
significantly disrupt more putative TF binding sites in ASD versus controls. Each 
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circle represents the number of TFs with x (count of mutations in ASD) and y 


coordinates (count of mutations in controls) of the circle. (©) Somatic mutations 


in the ASD cohort predicted to lead to gain of binding sites for MEIS1, MEIS2, and 
MEIS3 in enhancer-like elements. The consensus motif is on top. Mutations on 
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Fig. 4. Detection and validation of large somatic structural mutations. 
uplication on chromosome 3 (chr3:113,067,261-113,233,476) 
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tool segments the genome, considering both 
depth of coverage and split in allele frequen- 
cies for single-nucleotide polymorphisms (SNPs), 
allowing us to call deletions, duplications, and 
copy number neutral losses of heterozygosity 
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(CNN-LOH). The called mutations can be either 
inherited or somatic, and we distinguished 
between these two types by their cell frequency 
(methods). We excluded brain LIBD82 for 
having aneuploidies and a few other brains 
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coordinates listed in table S3. For each mutation, a reference base is shown with 
a lowercase letter and the mutation base is noted by a capital letter above it. 
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matching to the same cell frequency (middle), and BAF for individual 
SNPs (bottom). SNPs in the accessible genome (P-bases) are in blue, 


other SNPs are in green. (B) Amplicon sequencing validation of the 


duplication. Bar plot shows the number of reads mapping to the junction 
of the duplication in the cortical region BA17 and caudate putamen of TS1 


and in control sample. DUP, duplication; DEL, deletion; LOH, loss of 


mutations detected from all analyzed individuals. 
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heterozygosity. (©) Summary of size and cell frequency of all structural 


because of poor coverage and data quality. We 
focused on a total of nine high-confidence calls 
encompassing an entire chromosome or for 
which we could resolve breakpoints (tables S4 
and S5 and figs. S7 to S21). For five mutations, 
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we attempted validation in brain tissue by 
polymerase chain reaction across the break- 
points, which confirmed all of them (Fig. 4B 
and fig. S7). Two more mutations were pre- 
viously validated (79). 

In total, we detected nine rearrangements 
in nine nonhypermutable brains (7% of all 
brains) ranging from 70 kilo-base pairs to 
16.5 mega-base pairs in length, with the largest 
mutation representing the loss of chromosome 
Y. That chromosome loss and another deletion 
were detected only in ASD brains. Seven somat- 
ic structural mutations were duplications, 
which were found in ~5% of both normal 
and diseased brains. The two duplications dis- 
covered in the Tourette cohort (brains TS1 and 
NC6, the only brains for which more than one 
region was available) were present in all bulk 
samples and fractions analyzed (Fig. 4A and 
fig. S7A). This finding is consistent with the 
idea that all these duplications occurred during 
development, which is also supported by their 
frequency in at least 10% of cells. Consistent 
with a developmental origin, sequence micro- 
homologies (1 to 4 base pairs) were found in 
most of the resolved breakpoints (fig. S22), 
suggesting that the corresponding mutations 
were generated by replicative mechanisms, as 
previously observed for mosaic copy number 
alterations in the fetal human brain (20). In 
many aspects, duplications resemble de novo 
copy number variants (CNVs) in the CNV muta- 
tor phenotype (27)—in that they are mostly 
tandem duplications of early developmental 
origin and have sequence microhomologies 
at breakpoints—although the mutations de- 
tected in this study were smaller than those 
reported in previous studies. Thus, we hy- 
pothesize that somatic duplications represent 
background mutations that arise in early de- 
velopment, with the CNV mutator phenotype 
representing the extreme manifestation of this 
phenomenon leading to a disease phenotype. 


Discussion 


Our study reveals a complex etiology of somatic 
mutations in the brain and their possible re- 
lationship with phenotype. While most of the 
studied brains had just a few dozen mutations 
of likely developmental origin in proliferating 
cells, ~6% of brains were classified as hyper- 
mutable. Hypermutability was not related to 
disease but did increase with age, reaching at 
least a 3% population frequency (95% confidence 
interval) for brains over 40 years old. The exact 
cause of hypermutability is yet to be determined, 
but besides a possible higher mutation rate, in 
some brains it originates from expansion of a cell 
lineage. Given the association of hypermutability 
with age and its frequent localization in one 
brain region, we hypothesize that it could be 
related to local clonal gliogenesis in adult brains, 
consistent with previous reports of increased 
glial fraction with age (22, 23). Such gliogenesis 
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could be relevant to cognitive aging, such as 
Parkinson’s and Alzheimer’s disease. Alterna- 
tively, hypermutability could be a remnant of a 
brain tumor escaped earlier in life. 

Half of the hypermutable brains studied in 
this work carried mutations in genes that are 
involved in clonal hematopoiesis in aging adults 
(24), consistent with previous observations from 
the analysis of a panel of genes (25). Thus, it is 
tempting to speculate that increases in clonal 
hematopoiesis with age (24), coupled with a 
leakier blood-brain barrier with age and extra- 
vasation of blood in brain tissue (26), could 
explain the higher mutation burden in those 
brains. Analysis of expression data in brain 
NC7 is consistent with an increased fraction 
of blood cells in that brain. However, analysis 
of copy number for loci of immunoglobulin 
receptors in B and T cells does not reveal an 
obvious presence of immune cells in the clonally 
expanded lineage (fig. S23). Furthermore, in that 
brain, all mutations were present in two regions 
and were enriched in postmitotic neuronal 
cell fractions, lending support to yet another 
possibility, that of a neural lineage clonal ex- 
pansion during development. In support of 
this scenario, a spectrum of de novo variants 
in DNMTS3A, which overlaps with those found 
in hematologic malignancies, causes Tatton- 
Brown-Rahman syndrome, a developmental 
disorder with macrocephaly (27). It was also 
recently shown that human interneurons, in 
contrast to other neurons, continue to ex- 
pand until birth (28). Nonetheless, establishing 
the cause(s) and extent of the phenomenon 
of hypermutability requires analysis of larger 
sets of brains and additional analyses at the 
single-cell level. 

Somatic structural mutations were detected 
in ~7% of brains, but because most of them were 
duplications (in ~5% of brains), they are unlikely 
to have functional consequences. The ASD co- 
hort was the only one where deletions were de- 
tected. Additional CNV calls (without resolved 
breakpoint but with confirmed haplotype im- 
balance) were also made only in the ASD cohort 
(table S4), perhaps pointing to the relevance 
of somatic deletions for the ASD phenotype. 
That said, further studies of larger ASD and 
other disease cohorts are necessary to make 
definitive conclusions on this observation. 

In ASD brains, somatic point mutations in 
noncoding regions create putative TF binding 
motifs within enhancer-like elements that are 
active in the developing human brain. We 
hypothesize that some of the newly created 
motifs result in binding of their corresponding 
TFs, thereby affecting TF protein dosage and 
dysregulating gene regulatory networks. The 
top-ranked affected motifs are putative binding 
sites for MEIS genes, which are homeodomain- 
containing transcriptional activators that pro- 
mote chromatin decompaction (29). MEIS TFs 
have been implicated in intellectual disabilities; 


act as cofactors of HOX genes; and are regulators 
of proliferation, growth, neurogenesis, and pat- 
terning during development (29). MEIS2 is 
highly expressed in cortical development and 
was defined as a marker of a subpopulation of 
cortical interneurons that populates the white 
matter (30). We previously proposed that dele- 
terious de novo ASD variants are enriched in 
homeodomain TF binding motifs within de- 
velopmental enhancer-like elements (37). Sim- 
ilarly, enhancer-like elements associated with 
ASD genes from the Simons Foundation 
Autism Research Initiative (SFARD collection 
are enriched for homeobox binding motifs 
(32). Therefore, we suggest that functional studies 
of mutations within homeodomain binding 
motifs during human development might aid 
efforts in understanding the etiology of ASD. 
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Bioadhesive ultrasound for long-term continuous 


imaging of diverse organs 


Chonghe Wang"t, Xiaoyu Chen‘{, Liu Wang’, Mitsutoshi Makihata“, Hsiao-Chuan Liu®, 


Tao Zhou, Xuanhe Zhao'** 


Continuous imaging of internal organs over days could provide crucial information about health 

and diseases and enable insights into developmental biology. We report a bioadhesive ultrasound (BAUS) 
device that consists of a thin and rigid ultrasound probe robustly adhered to the skin via a couplant 
made of a soft, tough, antidehydrating, and bioadhesive hydrogel-elastomer hybrid. The BAUS device 
provides 48 hours of continuous imaging of diverse internal organs, including blood vessels, muscle, 
heart, gastrointestinal tract, diaphragm, and lung. The BAUS device could enable diagnostic and 


monitoring tools for various diseases. 


earable devices that can continuously 
monitor human physiology in a non- 
invasive manner represent a pivotal 
trend in precision and digital med- 
icine (J). Particularly, deep-tissue phys- 
iology, including internal organ signals and 
dynamics, contains crucial information regard- 
ing health and diseases (2). Although current 
wearable devices have successfully recorded 
physical and chemical signals from the skin 
such as electrocardiogram data (3-5) and sweat 
metabolites and electrolytes (6-8), clinical-grade 
imaging of various internal organs remains a 
central task and a challenge in the field of 
wearable devices. Ultrasound imaging with 
zero radiation allows clinicians to evaluate 
tissue and organ functions and diagnose various 
diseases (9). If sampled frequently over days 
to months, ultrasound imaging of internal 
organs could help clinicians monitor health 
status, observe disease progression, and assess 
disease risks (10-12); it could also lead to dis- 
coveries in and a better understanding of 
developmental biology (73). 

Wearable ultrasound devices have the po- 
tential for continuous imaging of internal or- 
gans. Conventional wearable ultrasound imaging 
usually relies on mounting bulky ultrasound 
probes on the skin by either robotic systems 
(14, 15) or mechanical fixtures such as straps 
and tapes (16-18), which hamper patients’ 
mobility and wearing convenience and comfort. 
In addition, conventional wearable ultrasound 
imaging is only applicable to limited body parts 
such as muscles because of the thick ultrasound 
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probe and the high pressure exerted by the 
probe (16). For example, mechanically moun- 
ting a conventional ultrasound probe on the 
neck to image the carotid artery, jugular vein, 
and vagus nerve could cause suffocation. 

Although the stretchable ultrasound imag- 
ing device has improved wearability (12), it 
still suffers from limitations, including low 
imaging resolution, unstable imaging quality 
during body motions (fig. S1), a short continuous 
imaging duration (1 hour), and susceptibility to 
device failure (table S1). (The continuous imag- 
ing duration is defined as the longest time that 
a device can be adhered or worn on the skin.) 
These limitations mainly stem from the design 
of the device, which consists of piezoelectric 
elements over a stretchable substrate (Fig. 1A). 
Although the stretchable substrate can deform 
conformally with the skin, it limits the density 
of elements (156 per square centimeter) and 
is incompatible with backing and matching 
layers, leading to a low imaging resolution 
(72). In addition, when the substrate deforms 
with the skin under body motions, the ele- 
ments’ spatial and angular positions vary 
unpredictably, which hampers the imaging 
stability of the device (fig. S1). Furthermore, 
existing wearable ultrasound devices, in both 
rigid and stretchable form factors, mostly rely 
on hydrogel or elastomer couplants for acoustic 
transmission to the skin (19-22). However, the 
hydrogel couplants usually get dehydrated 
or detached from the skin over a few hours 
(19, 20), and the elastomer couplants are too 
damping to image deep organs (21, 22) (fig. 
$2 and table S2). 

We report a bioadhesive ultrasound (BAUS) 
device that consists of a thin and rigid ultra- 
sound probe robustly adhered on the skin via 
a couplant layer made of a soft, tough, anti- 
dehydrating, and bioadhesive hydrogel-elastomer 
hybrid (Fig. 1, B to D). The BAUS probe enables a 
high density of elements (400 elements per 
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Fig. 1. Design and imaging performances of the BAUS device. (A) Existing epidermal wearable 
ultrasound imaging usually relies on thin and stretchable devices physically attached to the skin. (B) The 
BAUS device consists of a thin and rigid ultrasound probe robustly adhered to the skin via a couplant 
made of a soft, tough, antidehydrating, and bioadhesive hydrogel-elastomer hybrid. The dashed 
ectangle indicates the region shown on the right in (C). (©) The BAUS couplant consists of a tough 
hydrogel, containing 90 wt % water, that is encapsulated by an elastomer membrane further coated by a 
thin bioadhesive layer. The BAUS couplant maintains robust adhesion between the BAUS probe and 
the skin over 48 hours and insulates the BAUS probe from skin deformation during dynamic body 
motions. (D) Scanning electron microscopy (SEM) image of the cross section of the BAUS couplant. The 
total thickness of the elastomer membrane and the bioadhesive layer is less than one-fourth of the 
ultrasound wavelength a. Scale bar is 10 um. (E) Schematics of the BAUS probe structure. The BAUS 
probe consists of an array of high-performance piezoelectric elements, which are controlled by the top 
and bottom circuits. The top and bottom circuits are covered by the acoustic backing and matching 
ayers, respectively. The dashed rectangle indicates the region shown in (F). (F) Optical microscopic 
image of a BAUS probe with 3D-printed top and bottom circuits. The inset image provides a zoomed-in 
top view of the device showing the top circuit and piezoelectric elements. Scale bar is 0.5 mm. 

(G) Comparison of axial-imaging resolutions and continuous-imaging durations of existing wearable 
ultrasound imaging devices and the BAUS device. 
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square centimeter; fig. S3), stable element posi- 
tions under dynamic body motions, and high 
reliability of the probe in long-term applica- 
tions. The BAUS couplant effectively transmits 
acoustic waves, insulates the BAUS probe from 
skin deformation, and maintains robust and 
comfortable adhesion on the skin over 48 hours. 

The BAUS probe consists of an array of 
high-performance piezoelectric elements with 
a center frequency of 3, 7, or 10 MHz (Fig. 1, E 
and F; fig. S3; and table $3). Each element is 
controlled by the top and bottom circuits of 
the BAUS probe. The circuit can be fabricated 
with three techniques: three-dimensional (3D) 
printing (Fig. 1F, fig. S4, and movie S2) (23), 
laser etching (24), and photolithography (fig. 
S5) (25), which give circuit-line resolutions 
of 100, 10, and 1 um, respectively. The bottom 
circuit is covered by an optimized acoustic 
matching layer to enhance the acoustic trans- 
mission to the skin, and the top circuit is covered 
by an optimized acoustic backing layer to 
quench the resonance effect (fig. S6; see mate- 
rials and methods for details on fabrication 
and optimization of the matching and backing 
layers). The BAUS probe is sealed by a layer of 
epoxy for high stability and reliability in long- 
term applications. We further designed a plug- 
and-play input-output to connect with the 
flexible flat cable (fig. S7). The BAUS probe 
has a thickness of 3 mm, length and width of 
1 to 2 cm, and weight of 10 to 40 g (fig. 83 
and table S3). 

The BAUS couplant consists of a soft yet tough 
hydrogel composed of chitosan-polyacrylamide 
interpenetrating polymer networks (10 wt %) 
and water (90 wt %) (see materials and 
methods for details on fabrication of the BAUS 
couplant). The hydrogel is encapsulated by a 
thin elastomer membrane (thickness of <40 tum) 
of polyurethane to prevent dehydration of 
the hydrogel and to provide comfortable skin 
contact with a dry couplant surface (figs. 
S8 and S9). The polyurethane membrane is 
grafted with poly(acrylic acid) coupled with 
N-hydroxysuccinimide ester (NHS ester) to 
form robust bonding with the hydrogel (26). 
The hydrogel-elastomer hybrid is further coated 
by a thin bioadhesive layer (thickness of <10 tum) 
synthesized by copolymerizing poly(ethylene 
glycol) diacrylate, 2-ethylhexyl acrylate, acrylic 
acid, and acrylic acid-NHS ester (fig. S10). The 
carboxylic acid, ethyl, and hexyl groups in the 
bioadhesive can form physical bonds such as 
hydrogen bonds and electrostatic interactions 
with the skin; the NHS ester groups in the 
bioadhesive can form covalent amide bonds 
with the skin (26). The total thickness of the 
elastomer membrane and the bioadhesive 
layer is less than one-fourth of the acoustic 
wavelength to minimize the acoustic attenua- 
tion of the BAUS couplant (Fig. 2D). 

To quantitatively validate that the BAUS 
device can robustly adhere to the skin under 
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Fig. 2. Adhesion and acoustic performances of the BAUS device. (A) Schematics of the mechanical 
model for a rigid device adhered on the skin via a couplant layer. EF, couplant Young's modulus; E., skin 
Young's modulus; H, couplant thickness; L, device length. (B) The energy release rate G as a function of the 
couplant thickness H calculated by the model with typical parameters: skin and couplant Young's modulus F, = E = 
100 kPa, device length L = 2 cm, applied strain on the skin €app = 20%, and interfacial toughness between the 
couplant and the skin T. (C) Interfacial toughness between the BAUS couplant and porcine skin measured 


over 48 hours after adhesion formation. Error bars indicate standard deviation (£SD 


of interfacial toughness 


(n = 4 independent samples). (D) Acoustic attenuation coefficients of the BAUS couplant at frequencies from 1 to 
10 MHz over 48 hours in air. (E) The BAUS device adhered on the skin can withstand high pulling forces and maintain 
robust adhesion on the skin over 48 hours. Scale bar is 10 mm. 


dynamic body motions, we developed a model 
of a rigid device adhered to the skin via a soft 
couplant layer (Fig. 2A; see fig. S11 and mate- 
rials and methods for details on the model). 
When the skin is stretched under an applied 
strain €pp, a crack may initiate at the edge of 
the couplant-skin interface and propagate. 
Crack propagation is driven by the energy 
release rate G (defined as the reduction of 


stored elastic energy in the couplant-skin 
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system when the crack propagates by a unit 
area) and resisted by the interfacial toughness 
T (defined as the energy required to propa- 
gate the crack by a unit area). In Fig. 2B, we 
calculate the energy release rate G of a couplant- 
skin system under the plane-strain condition 
with a typical device length Z of 2 cm and a 
typical applied strain on the skin €ap, of 20%. 
For simplicity of calculation, the skin and the 
couplant layer are taken as incompressible 


linear elastic materials with the same Young’s 
modulus of 100 kPa. If the energy release rate 
is higher than the interfacial toughness G > 
(unshaded zone in Fig. 2B), the crack prop- 
agates on the interface and the device debonds 
from the skin. When the rigid device is phy- 
sically attached on the skin without any cou- 
plant (couplant thickness H = 0), the interfacial 
toughness is ~1 J/m?, and the rigid device 
detaches from the skin. By contrast, when the 
rigid device is adhered to the skin via the 
BAUS couplant, the interfacial toughness ex- 
ceeds 500 J/m? both in air and under water 
(Fig. 2C), robustly bonding the device on the 
skin for any couplant thickness. In fig. S11, 
we calculate the energy release rate of the 
couplant-skin system with various parameters 
for the future design of bioadhesive devices. 

To evaluate the adhesion performance of 
the BAUS couplant on the skin, we conducted 
the standard 90°-peeling test (ASTM D2861) 
to measure the interfacial toughness of the 
elastomer membrane adhered on porcine skin 
via the bioadhesive layer (see materials and 
methods for details on the test). We choose 
porcine skin as the model tissue for evaluating 
adhesion performance because of its resem- 
blance to human skin (26). The elastomer 
membrane can establish tough adhesion on 
porcine skin upon contact, providing an in- 
terfacial toughness of 853 J/m?. When the 
adhered sample is stored in air (relative 
humidity of 30 to 50%; temperature of 24°C), 
the interfacial toughness decreases to 756 J/m? 
over 48 hours. When the adhered sample is 
stored in water to simulate wet environments, 
the interfacial toughness decreases to 518 J/m? 
over 48 hours. These results (Fig. 2, C and E) 
show that the bioadhesive layer can robustly 
adhere the BAUS device on the skin over 48 hours 
in dry and wet environments. The BAUS device 
can be detached from the skin by peeling from 
one edge of the BAUS couplant without leav- 
ing any residue (fig. S12). In comparison, the 
interfacial toughness of typical liquid (Aqua- 
sonic Clear, Parker Labs) and solid (Aquaflex, 
Parker Labs) hydrogel couplants and elastomer 
couplants, including silicone (Ecoflex, Smooth- 
On), polyurethane (Sigma-Aldrich), and Aqua- 
lene (Olympus), are all less than 10 J/m?, which 
cannot give stable adhesion of ultrasound 
probes on the skin. Although the 3M VHB 
acrylic tape can maintain a relatively high 
interfacial toughness (>456 J/m?) in air over 
48 hours, the interfacial toughness reduces to 
27 J/m? in water, possibly leading to unstable 
adhesion in wet environments. 

To evaluate the wearing comfort of the 
BAUS couplant on the skin, we adhered a set of 
BAUS couplants on various body locations— 
including arm, neck, chest, and waist—of 15 
subjects for 48 hours (fig. S9). Most of the 
subjects reported no positive reaction (i.e., 
itchiness, irritation, dryness, or redness) at any 
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Fig. 3. Long-term continuous 
BAUS imaging of blood vessels. 
See also fig. S30 and movie S5. 
(A) Schematics of the BAUS 
device adhered on the neck of 
the subject to image the carotid 
artery (CA) and jugular vein (JV). 
(B) The BAUS device can stably 
image the blood vessels under 
dynamic body motions such as 
neck rotation with angles up to 
+30° (C) The diameter of the 
jugular vein increases substan- 
tially from the sitting or standing 
position to the supine position. 
(D) Color-flow imaging of the 
carotid artery by the BAUS device. 
The diameter of the carotid artery 
and the blood flow rate in the 
carotid artery increase substan- 
tially after 0.5 hours of physical 
exercise. Fw, forward; Bw, 
backward. (E) Blood pressure 
waveforms of the carotid artery 
before and after 0.5 hours of 
physical exercise. The systolic 
blood pressure of the carotid 
artery increases substantially after 
0.5 hours of physical exercise. The 
decrease in the blood pressure 
from the systolic peak in each 
cardiac cycle is much steeper 
after 0.5 hours of physical exer- 
cise. (F) The systolic blood pres- 
sure of the carotid artery over 

48 hours as measured from the 
BAUS imaging. The systolic 

blood pressure is measured by 
calculating the mean and standard 
deviation of 15 consecutive 
systolic peaks every 0.5 hours. 
Error bars indicate standard 
deviation (SD) of measured sys- 
tolic pressure (n = 15 independent 
samples). The 0.5-hour training 
periods are indicated by the 
dashed lines. 


location; only one subject reported minor it- 
chiness after 48 hours of continuous wearing 
(fig. S13). As control samples, the abovemen- 
tioned typical couplants were adhered (3M VHB 
acrylic tape) or mechanically wrapped (others) 
on the arms of the 15 subjects. The liquid and 
solid hydrogel couplants received the highest 
positive reactions, and the positive reactions of 
the elastomer couplants were also higher than 
that of the BAUS couplant. These results are 
consistent with the comfort level of the sub- 
jects as evaluated by the visual analog scale 
(VAS): the BAUS couplant is most comfortable, 
and the liquid and solid hydrogel couplants are 
least comfortable (fig. S13). 
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To evaluate the acoustic performance of the 
BAUS couplant, we used the transmission- 
through method to measure the attenuation 
coefficients of the BAUS couplant. At frequen- 
cies from 1 to 10 MHz, the attenuation coefficients 
of the BAUS couplant were measured to be 
from 0.036 to 0.131 dB/mm (Fig. 2D and fig. 
S14). We further measured the acoustic im- 
pedance of the BAUS couplant to be 1.59 MRayl, 
which matches the acoustic impedance of the 
skin. In comparison, the typical liquid and solid 
hydrogel couplants have similar attenuation 
coefficients as those of the BAUS couplant 
at frequencies from 1 to 10 MHz: 0.070 to 
0.120 dB/mm and 0.117 to 0.174 dB/mm, re- 
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spectively. However, the attenuation coefficients 
of all abovementioned elastomer couplants are 
multiple times higher than that of the BAUS 
couplant at the same frequency. This is con- 
sistent with the result that these elastomer 
couplants are too damping to allow wearable 
ultrasound imaging of internal organs (fig. $2). 
To evaluate the performances of the BAUS 
probes, we first performed electrical imped- 
ance measurements on all elements in the 
probes (fig. S15 and table S4:). The measured 
center frequency of the elements on each BAUS 
probe is indeed 3, 7, or 10 MHz (with a minor 
deviation of <5%) as prescribed; the elec- 
trical impedances of the probes at their center 
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Fig. 4. Long-term continuous 
BAUS imaging of lung, dia- 
phragm, heart, and stomach. 
See also figs. S33 to S36. 

(A) Schematics of the BAUS 
device adhered to the chest 

of the subject to image the lung. 
(B) The BAUS imaging shows 

a smooth pleural line and 
epetitive A-lines of the lung. 
(C) The BAUS imaging (right) 
shows clear diaphragm 
morphology. A diagram of 

he placement of the BAUS 
device is shown on the left. 

(D) The diaphragm motility 
increases substantially in ampli- 
ude and frequency after 

0.5 hours of physical exercise. 
(E) The BAUS imaging shows 
the dynamics of the four 
chambers of the heart. The size 
of the left ventricle greatly 
increases after 0.5 hours of 
physical exercise. (F) The BAUS 
imaging shows the dynamics 

of the stomach. The gastric 
antral cross-sectional area 
gradually decreases after the 
subject drinks 450 ml of juice. 
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frequencies are 49, 36, or 43 ohms, respectively 
(fig. S16 and table S4). We then measured the 
cross-talk in each BAUS probe to be less than 
-40 dB, which is sufficiently low for high- 
quality imaging applications (fig. S17 and table 
S4) (27). We next performed the pulse-echo 
tests on all BAUS probes to measure their 
bandwidths. The measured bandwidths of the 
3-, 7-, and 10-MHz BAUS probes are 68, 75, and 
78%, respectively (fig. S18 and table S4). 
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To evaluate the imaging resolutions of the 
BAUS probes, we first performed the pulse- 
echo tests on the BAUS probes to characterize 
their axial resolutions (fig. S19). The axial re- 
solutions of the 3-, '7-, and 10-MHz BAUS probes 
are 0.77, 0.225, and 0.1924 mm, respectively 
(fig. S19 and table S4). We then measured the 
point spread functions of the BAUS probes 
to characterize their lateral resolutions. The 
lateral resolutions of the 3-, 7-, and 10-MHz 
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BAUS probes are 1.79, 0.38, and 0.38 mm at 
their imaging focal depths of 6, 3, and 2 cm, 
respectively (figs. S20 to S22 and table S4). 
These measured values are also confirmed 
by the acoustic simulations (figs. S20 to S22). 
In comparison, the axial resolution of the 
stretchable ultrasound imaging probe (2 MHz) 
is reported to be 5.775 mm (22). 

To test the imaging stability of the BAUS 
device, we adhered the BAUS device on a 
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phantom of a blood vessel in soft tissues under 
various frequencies of vibration (0 to 3.5 Hz 
with an amplitude of 2 cm) to evaluate its 
acoustic performance under dynamic body 
motions. Because of stable adhesion of the 
BAUS device to the phantom and the relatively 
high imaging frame rate (40 Hz), the motion 
artifact on imaging is minimal (fig. S23). 

To test the thermal properties of the BAUS 
device, we measured the temperature of the 
BAUS device when continuously actuated 
by various levels of working voltages (0 to 
90 V) over 48 hours. Owing to its low working 
power and thin form factor for heat dissipa- 
tion, the BAUS device can maintain a constant 
level of temperature (24°C) while continu- 
ously imaging for 48 hours (figs. S24 and S25), 
validating its thermal stability in long-term 
applications. 

We demonstrate the BAUS imaging plat- 
form’s capability to perform 48-hour continuous 
imaging of human organs, including blood ves- 
sels and heart, muscle and diaphragm, stom- 
ach, and lung. These organs represent the 
cardiovascular, muscular, digestive, and respi- 
ratory systems, respectively. To image organs 
with depths less than 6 cm beneath the skin 
(i.e., jugular vein, carotid artery, and bicep 
muscle), we chose an ultrasound probe with 
acenter frequency of 7 or 10 MHz and adopted 
the plane wave compounding method (fig. S26). 
To image organs deeper than 6 cm beneath the 
skin (ie., heart, stomach, diaphragm, and lung), 
we chose an ultrasound probe with a center 
frequency of 3 MHz and adopted the phased 
array imaging method (figs. S27 to $29 and 
movies S3 and S4; see materials and methods 
for details on acoustic simulation). The probes 
are adhered to various locations on the skin 
of a healthy subject via the BAUS couplants 
for continuous imaging of the corresponding 
organs over 48 hours (see materials and methods 
for details on data acquisition and image 
processing). 

Figure 3 and fig. S30 demonstrate the 
imaging results of the jugular vein and carotid 
artery obtained by the 10-MHz BAUS device 
adhered to the right-hand side of the neck 
(Fig. 3A). Dynamic body motions such as neck 
rotation can cause blood vessel shifting and 
skin deformation, which deteriorate the imag- 
ing quality and stability of existing wearable 
ultrasound devices (17). By contrast, the BAUS 
device can provide 48-hour continuous imag- 
ing of the jugular vein and carotid artery 
under dynamic body motions such as neck 
rotation with angles up to +30° (Fig. 3B and 
movie S5). This is because of the robust ad- 
hesion of the BAUS device on the neck, the 
high imaging resolution of the BAUS probe, 
and the large acoustic window given by the 
BAUS probe aperture (2 cm by 1 cm). The 
BAUS imaging results show that the diameter 
of the jugular vein in the subject increases 
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from the sitting or standing position to the 
supine position (Fig. 3C and fig. S30). The 
change of the jugular vein diameter strongly 
correlates with the right atrium pressure (28) 
and thus can be used to diagnose cardiac 
diseases such as heart failure and pulmonary 
hypertension (29). The BAUS imaging results 
further provide 48-hour continuous blood 
flow rate and blood pressure waveform data 
of the carotid artery (Fig. 3, D to F, and fig. 
S30). For example, we observed dramatic in- 
creases in blood flow rate (from 65 to 117 cm/s; 
Fig. 3D) and systolic blood pressure (from 115 
to 168 mmHg; Fig. 3E) of the carotid artery in 
the subject before and after 0.5 hours of physical 
exercise (see materials and methods for details 
on data acquisition and calculation of the blood 
pressure). Moreover, from the blood pressure 
waveform measured after the 0.5-hour jogging 
session (right panel of Fig. 3E), we can see a 
steeper decrease in the blood pressure from 
the systolic peak in each cardiac cycle than before 
the exercise, possibly because of the exercise- 
induced vasodilation of the carotid artery. 

Figure S31 demonstrates the imaging re- 
sults of the bicep muscle obtained by the 7-MHz 
BAUS device adhered to the upper arm of the 
subject (fig. S31). The BAUS system can provide 
48-hour continuous imaging of the bicipital 
muscle. For example, we observed increased 
blood perfusion and echogenicity on the muscle 
and tendon interface 2 hours after a 1-hour 
weightlifting training session and increased 
vascularity 10 hours after the training (fig. $32). 

Figure 4, A and B, and fig. S33 demonstrate 
the imaging results of the lung obtained by the 
3-MHz BAUS phased-array device adhered 
on the right chest (longitudinal direction) of 
the subject (Fig. 4A). The BAUS system can 
provide 48-hour continuous imaging of key 
characteristics of the lung, including the 
pleural line and A-lines. The BAUS device is 
also able to continuously and stably image the 
lung under body motions such as jogging and 
cycling (fig. S34). The BAUS imaging shows a 
smooth pleural line and repetitive A-lines of 
the subject over 48 hours, indicating a healthy, 
normally aerated lung. Although lung ultraso- 
nography has proven to be an effective tool 
for diagnosing COVID-19 patients (30), it is 
mostly available in hospitals for infrequent 
and noncontinuous imaging of the lung. By 
contrast, the wearable BAUS system could 
continuously monitor the symptoms of possi- 
bly infected COVID-19 patients at home. 

Figure 4, C and D, demonstrates the imaging 
results of the diaphragm motility obtained by 
the 3-MHz BAUS phased-array device adhered 
on the right anterior subcostal region of the 
subject before and after 0.5 hours of physical 
exercise. The BAUS imaging shows that the 
diaphragm motility after training is more dra- 
matic in amplitude and frequency compared 
with quiet breathing before training. 


Figure 4E and fig. S35 demonstrate the 
imaging results of the heart obtained by the 
3-MHz BAUS phased-array device adhered 
on the left chest of the subject (as visualized 
through the apical four-chamber view). The 
BAUS system can provide 48-hour continuous 
imaging of the dynamics of the four cardiac 
chambers: the right ventricle, right atrium, left 
ventricle, and left atrium. The BAUS device is 
also able to continuously and stably image the 
heart under body motions such as jogging and 
cycling (fig. S34). For example, we observed that 
the size of the left ventricle greatly increases 
after 0.5 hours of physical exercise. The contin- 
uous images of the cardiac chambers (fig. S35) 
can be further processed to calculate the dy- 
namics of cardiac strain, which is an important 
parameter that could indicate cardiomyopathy. 

Figure 4F and fig. S36 demonstrate the 
imaging results of the stomach obtained by 
the 3-MHz BAUS phased-array device adhered 
on the upper abdomen skin of the subject. The 
BAUS imaging system provides 48-hour con- 
tinuous imaging of the gastric antral cross section. 
For example, we observed an extended gastric 
antrum once the subject drank 450 ml of juice 
(Fig. 4F). Over the next 2 hours, the gastric 
antral cross-sectional area gradually decreased 
because of gastric emptying (Fig. 4F and fig. S36). 

Although the imaging resolutions of the 
BAUS probes are superior to those of existing 
wearable ultrasound devices (Fig. 1G and table 
S1), the BAUS probes can be further improved 
for better imaging quality in the future. The 
pitch of the current BAUS probes is relatively 
large (0.5 mm; table S3) and can cause grating 
lobes, especially in the 7- and 10-MHz BAUS 
probes. Although the grating-lobe artifacts can 
be alleviated by using the plane wave com- 
pounding method (fig. S26) (37), the pitches 
of future BAUS probes will be reduced to 
eliminate the artifacts. Although the current 
BAUS probes do not have elevation focusing, 
thin-profile (<1 mm) elevation lenses (32) can 
be added to future BAUS probes for elevation 
focusing to achieve better imaging quality, es- 
pecially for deep organs. 

Because the current work is focused on 
designing and fabricating the BAUS devices, 
we use an external system (Verasonics Vantage 
system) for data acquisition. Despite the large 
size of the data acquisition system, the BAUS 
devices may find immediate applications in 
nonmobile imaging in clinical settings such 
as long-term imaging of patients in intensive- 
care units. In addition, because the data acquisi- 
tion systems of point-of-care ultrasound devices 
(e.g., GE VScan, Butterfly IQ, and Phillips 
Lumify) have been miniaturized to the size 
of a cell phone, the data acquisition system 
of BAUS can be miniaturized in the future. 
Beyond clinical applications, the BAUS imaging 
platform could provide long-term continuous 
imaging of the developments of tissues and 
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organs—including embryo, tumors, and brain— 
in a noninvasive manner (33, 34). 

The current paradigm for the biointegration 
of devices is to make the devices thin and 
stretchable for conformal attachment on the 
body. Here, we propose a different paradigm 
for biointegration: to robustly adhere thin and 
rigid devices on the body via a soft, tough, and 
bioadhesive couplant (Figs. 1 and 2). Develop- 
ing bioachesive couplants that enable electrical, 
optical, thermal, chemical, and/or biological 
interfacing with the body could open new 
avenues to future biointegrated devices. 
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Constraints on the adjustment of tidal marshes to 


accelerating sea level rise 
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Much uncertainty exists about the vulnerability of valuable tidal marsh ecosystems to relative sea level rise. 
Previous assessments of resilience to sea level rise, to which marshes can adjust by sediment accretion and 
elevation gain, revealed contrasting results, depending on contemporary or Holocene geological data. By 
analyzing globally distributed contemporary data, we found that marsh sediment accretion increases in parity 
with sea level rise, seemingly confirming previously claimed marsh resilience. However, subsidence of the 
substrate shows a nonlinear increase with accretion. As a result, marsh elevation gain is constrained in relation 
to sea level rise, and deficits emerge that are consistent with Holocene observations of tidal marsh vulnerability. 


idal marshes are among the most vulner- 
able of the world’s ecosystems. Through- 
out human civilization, tidal marshes 
have been reclaimed for agriculture and 
settlement, and the pace of loss has ac- 
celerated in concert with burgeoning coastal 
populations on all inhabited continents over 
the past century (J, 2). To this pressure has 
been added the threat of accelerating sea level 
rise. Because tidal marshes occur within tightly 
defined elevation ranges relative to mean sea 


level, they are sentinel ecosystems at the fore- 
front of coastal climate change impact. Poten- 
tial tidal marsh loss with sea level rise threatens 
arange of ecosystem services valued at US 
~$27 trillion per year (3), extending to fisheries 
production, recreation, cultural heritage, coastal 
protection, water quality enhancement, and 
carbon sequestration. 

Sea level rise can lead to marsh loss through 
marsh edge erosion, conversion to mudflats, 
encroachment of mangrove forests where they 
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Fig. 1. Processes influencing marsh surface elevation and their measurement 


using the surface elevation table-marker horizon (SET-MH) monitoring station. 


Feedbacks among sea level rise, vertical accretion, and elevation gain are 
conceptualized as driving marsh substrates toward an equilibrium elevation within 
the tidal frame, facilitated by inputs of mineral and organic matter. The SET-MH 
method measures soil elevation relative to a benchmark (to which the portable 
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Fig. 2. Distribution of tidal marsh SET-MH monitoring stations used in the analyses, and deficit 
between elevation gain and contemporaneous local RSLR. Deficits are assigned positive numbers. The 
background of late Holocene (0 to 3000 years B.P.) RSLR is derived from glacio-isostatic modeling (20). 


1295 


Suspended sediment 


component of the SET is attached), while a tide gauge records the combined effect of 
changes in sea level and land movement occurring below the survey benchmark 
rod, to which the gauge is routinely leveled. This combined recording of eustatic sea 
level (ocean volume) change and deep land movement is termed relative sea level 
rise (RSLR) and does not include processes measured by the SET-MH method. GNSS, 
Global Navigation Satellite System; MERIS, Medium Resolution Imaging Spectrometer. 


to gain elevation through vertical accretion 
of mineral sediment and organic matter 
(6). Biophysical feedbacks between sea level 
rise and the vertical development of marsh 
substrates reduce the risk of loss occurring 
through conversion to unvegetated mudflats 
or open water (7). 

Modeling based on observations from US 
East Coast organic marshes (8) and UK mine- 
rogenic marshes (9) has suggested that an 
equilibrium may emerge among the position 
of a marsh within the tidal frame, plant pro- 
ductivity, root mass development, sedimentation, 
and the elevation of the marsh in response to 
mean sea level (Fig. 1). This equilibrium may 
be sustained under low rates of relative sea 
level rise (RSLR), the combination of vertical 
land movement and sea level change. How 
widely these controls and their upper limits 
operate across marsh sites globally has been a 
crucial and disputed question in the regional- 
to global-scale modeling of tidal marsh re- 
sponses to projected rates of RSLR under 
climate change (5, 7, 10). At present, observa- 
tions of contemporary marsh accretion suggest 
that marshes can adjust to rates of RSLR of 


150'E 


occupy lower tidal position, and/or the expan- 
sion of internal ponds and channels, with all 
mechanisms enhanced by the loss of marsh 
surface elevation relative to mean tide level (4). 
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The fate of tidal marshes under accelerating 
sea level rise will be determined not only by 
opportunities for landward marsh migration 
(5) but also by the capacity of tidal marshes 


>10 mm year * (7, 11, 12). However, the Holocene 
marsh record suggests that adjustment is highly 
unlikely (90% probability) at RSLR exceeding 
7 mm year! for UK tidal marshes (13) and 
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tropical mangroves (0), and 3 to 5mm year * for 
marshes in the Gulf of Mexico (14). Here, we 
report on contemporary tidal marsh elevation 
gain in relation to RSLR, testing the impor- 
tance of environmental conditions in mediating 
these responses. 

Several factors may influence the efficacy of 
tidal marsh vertical adjustment to sea level 
rise, but their relative contributions to explain- 
ing observed regional to global variability in 
marsh responses remain poorly elucidated. 
Globally, tidal range in marshes can vary from 
a few centimeters to 16 m, and this variability 
will influence the susceptibility of marshes to 
drowning, particularly where the tidal range is 
low relative to the projected RSLR (72, 15). The 
position within the tidal frame influences the 
depth and duration of inundation and the de- 
position of sediment, but it also influences 
mineral and organic accretion responses of the 
marsh vegetation occurring at these specific 
positions (8, 9). Tidal hydrodynamics and 
river discharge contribute to sediment delivery 
and accumulation (75), and these may be 
modified by flow control structures (16). Plant 
productivity is influenced by climate (precip- 
itation and temperature), salinity, nutrients 
(17), atmospheric COs, and vegetation com- 
position, which in turn influence soil organic 
carbon accumulation and decomposition. The 
rate of RSLR varies between coastlines and 
continents, and millennial-scale variability in 
RSLR may also control soil organic content, 
which may increase with sea level rise when 
conditions are favorable (18). Sampling at 
regional to global scales across hydrogeomor- 
phic settings and biogeographic regions can 
clarify the relative importance of these factors 
and determine the consistency of feedbacks 
facilitating marsh adjustment to RSLR. 

Accurate measurements of tidal marsh ver- 
tical adjustment in relation to sea level require 
a fixed benchmark against which elevation 
gain or loss can be measured. To this end, the 
surface elevation table-marker horizon (SET- 
MH) method has been developed as a global 
standard (19) for monitoring tidal marsh re- 
sponses to RSLR (Fig. 1). A rod is driven into 
the marsh to form a stable benchmark against 
which elevation change can be measured. Ver- 
tical accretion (the surface accumulation of 
inorganic sediment, organic sediment, and 
living roots) is also measured at most sites 
above an artificial marker horizon (typically 
white feldspar, clay, or sand) introduced at 
the time of the first measurements against the 
benchmark (20). Data from SET-MH stations 
have informed models of marsh resilience to 
RSLR (7, 12), global projections of tidal marsh 
and mangrove change in the coming century 
(5, 21, 22), and the influence of vertical ac- 
cretion on carbon sequestration (23, 24). 

We analyzed tidal marsh elevation adjust- 
ment in relation to RSLR from SET-MH moni- 
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Fig. 3. The increasing vulnerability of tidal marshes to RSLR. (A and B) Accretion increases in parity 
with the 50-year RSLR trend (A) and with marshes lower in the tidal frame (B). (© and D) However, the rate 
of shallow marsh subsidence increases with the rate of vertical accretion, with an upward inflexion as RSLR 
increases between 5 and 10 mm year (C), suppressing elevation adjustment to RSLR (D). (E and F) As 

a result, the deficit between elevation gain and RSLR increases with the 50-year RSLR trend (E) and the 
contemporaneous RSLR trend, the period over which individual SET-MH stations were measured (F). In 

(B) and (C), points are colored for the 50-year RSLR trend in mm year, and in (D) for estimated time to 
failure (years) under the elevation deficit against the 50-year RSLR trend (20). 


toring stations that met our criteria of emergent 
tidal marsh vegetation, sufficient length of re- 
cord (>3 years), and exclusion from hydrological 
or experimental manipulation. The resulting 
network of 477 tidal marsh SET-MH stations, 
across 97 sites, forms clusters in regions with 
distinct hydrogeomorphic histories and tidal 
and biogeographic characteristics thought to be 
important to marsh resilience (Fig. 2). In gen- 
eral, Southern Hemisphere stations (Australia 
and South Africa) are in estuaries subject to 
millennia of stable or falling sea levels, with 
micro- to mesotidal marshes on high, stable 
intertidal platforms typically low in percent- 
age of soil organic carbon (table S1 and data 


S1). North Atlantic coastlines (Bay of Fundy, 
Canada; UK; Belgium) are predominantly 
macrotidal, have been subject until recently to 
relatively stable (< +0.5 mm year’) or falling 
sea levels over the past few thousand years, 
have low soil organic carbon content, and are 
situated adjacent to waters with high total sus- 
pended matter. Coastlines in the network that 
are subject to low rates of sea level rise over the 
past few millennia include two large river deltas 
with a microtidal regime (the Mississippi, 
USA; the Ebro, Spain) and microtidal to 
mesotidal barrier estuaries and embayments 
(Venice Lagoon, Southern California; the US 
Gulf of Mexico chenier plains and estuaries). 
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Fig. 4. Rates of relative sea level rise and marsh responses in the 
observational and paleo record. (A) Histogram of SET-MH monitoring 
stations showing elevation deficit, elevation surplus, and stability (parity) 
ty of an elevation deficit 
with different rates of RSLR. (©) Histogram of paleo-marsh index points 


with contemporaneous RSLR. (B) Modeled probabili 


The US Atlantic and North Pacific coastlines 
have been subject to relatively high rates of 
RSLR for several millennia, forming organic- 
rich marshes situated within barrier and 
embayment geomorphic settings. Contem- 
porary RSLR varies between coastlines as a 
result of vertical land movement and climate 
variability (25), with relatively high rates of 
RSLR occurring in the Gulf of Mexico, the US 
North Atlantic, and parts of the North Pacific 
coasts and lower RSLR in most European and 
Southern Hemisphere sites (data S1). 

This variability in RSLR allowed us to iden- 
tify RSLR influences on marsh surface eleva- 
tion change, and on mechanisms driving the 
latter, including vertical accretion and shallow 
subsidence. SET-MH stations were monitored 
for an average of 10.1 years (range: 3.5 to 
20.0 years) over periods for which RSLR at 
nearest tide gauges (hereafter “contemporaneous” 
RSLR: Z = 6.81 + 6.41 mm year’) was signifi- 


cantly higher than both the 50-year average 
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(& = 3.75 + 2.73 mm year '; P < 0.001) and the 
3000-year average derived from glacio-isostatic 
modeling (20) (% = 0.65 + 0.72 mm year’; P < 
0.001). This range of RSLR values encompassed 
rates associated with marsh retreat in the 
Holocene stratigraphic record (J0, 13, 14). 
We therefore tested three hypotheses con- 
cerning the feedback among RSLR, vertical 
accretion, and elevation gain: (i) The rate of 
vertical accretion would increase with RSLR; 
Gi) the rate of vertical accretion would cor- 
respond to sediment availability; and (iii) ver- 
tical accretion would correspond to marsh 
elevation gain. We determined the extent to 
which these relationships are influenced by 
climatic, environmental, and edaphic condi- 
tions (table $2), including soil bulk density 
and organic carbon. 

The most important predictor of the rate 
of vertical accretion at a global scale was the 
50-year RSLR trend (7° = 0.48, P < 0.0001). 
The observation of vertical accretion parity 


showing positive, negative, and no tendency in relation to RSLR in UK 
Holocene marshes (13). (D) Modeled probability of positive sea leve 
tendency (i.e., sinking within the tidal frame) associated with different rates 
of Holocene RSLR. Numbers of observations for each RSLR increment are 
shown at the base of each column. 


with increased RSLR aligns with the predic- 
tions of feedback models suggesting marsh 
resilience to RSLR (7, 8, 12), although the 
relationship was stronger in organic than in 
minerogenic marshes (fig. S1). Marsh accretion 
across the network was higher at sites that are 
lower in the tidal frame [Fig. 3B; as measured 
by dimensionless D (20), an indicator of sub- 
mergence (26), P < 0.0001]. Annual average 
suspended matter in adjacent waters ex- 
plained less than 10% of global-scale variabil- 
ity in vertical accretion (fig. S2), and the 
incorporation of tidal range as an additional 
variable [as has recently been suggested (12)] 
did not improve the prediction of the rate of 
vertical accretion (linear regression 7” = 0.03; 
n = 410) or the 7° [typically <0.01 for low 
marshes (i.e., D > 0; mn = 168)], contrary to 
model projections (5, 7, 12). Vertical accretion 
on marsh surfaces in settings of low total sus- 
pended matter suggests an important role 
for accretion of autochthonous sediment (i.e., 
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organic and/or locally resuspended mineral 
matter). One caveat is that satellite-derived 
measures of suspended matter may not rep- 
resent sediment concentrations at the point 
of deposition, particularly in channelized estua- 
rine settings. However, hydrogeomorphic setting 
was also not strongly predictive of the rate of 
vertical accretion (fig. S2). 

Although vertical accretion was the most 
important control on surface elevation gain 
at the global scale (7° = 0.3; fig. $2), shallow 
subsidence mediates the relationship between 
vertical accretion and surface elevation gain 
(27, 28) (Fig. 1 and Fig. 3C). Shallow subsidence 
was greater under higher accretion rates (7? = 
0.34, P < 0.0001; Fig. 3C and fig. S3) and higher 
contemporaneous and 50-year RSLR (7° = 0.16, 
P < 0.0001; fig. S3). As a result, on average less 
than half of the sediment accreted above marker 
horizons translated into surface elevation gain, 
and this proportion decreased between 5 mm 
year‘ and 10 mm year’ of contemporaneous 
RSLR (P < 0.0001). The deficit between sur- 
face elevation gain and RSLR trend increased 
linearly with RSLR in all settings (Fig. 3, E and 
F), as did the proportion of SET-MH monitor- 
ing stations subject to an elevation deficit (Fig. 4). 

Marshes in the SET-MH network transition 
from a predominance of elevation surplus 
to elevation deficit over a similar range of 
RSLR, as has been historically observed in 
UK Holocene marshes (Fig. 4), which provide 
arecord that is unique for the number of index 
points associated with a range of RSLR his- 
tories (13). Contemporary observations from 
the tidal marsh SET-MH network, which ac- 
counts for shallow subsidence, were there- 
fore consistent with observations of tidal marsh 
and mangrove behavior during periods of 
relatively rapid sea level rise in the Holocene 
record (10, 13, 14, 22). Cumulative probabil- 
ities based on Bayesian modeling using the 
SET-MH record suggest that a drowning trajec- 
tory is likely (66% probability) at 3.6 mm year * 
and 4.6 mm year‘ in UK Holocene marshes and 
very likely (90% probability) at 7.6 mm year’ in 
the SET-MH record and 7.1 mm year” in UK 
Holocene marshes (13) (Fig. 4). Although 
several sites in the US Gulf and Atlantic coast- 
lines had a contemporary rate of elevation 
gain exceeding 8 mm year ", these same sites 
had the lowest median projected time to 
open-water conversion, as estimated by the 
time to reach minimum survival elevation (20) 
(table S5). The elevation subsidy provided by 
their proximity to eroding shorelines (fig. S4) 
may represent laterally migrating levees 
(29), a precursor to marsh failure (table S4 
and fig. S5). 

In locations where sea level has been stable 
(<+0.5 mm year’) or falling over recent mil- 
lennia (i.e., the macrotidal marshes of the 
North Atlantic and the Southern Hemisphere 
Australian and South African marshes), soil or- 
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ganic carbon concentrations were significantly 
lower (on average less than half) relative to 
marshes subject to millennial-scale RSLR (P < 
0.0001; table S1 and fig. S6). Gradually rising 
sea levels can both promote and preserve highly 
organic marshes (18, 30) by increasing plant 
productivity, increasing organic carbon burial, 
reducing oxidation, and slowing decomposi- 
tion. At a global scale, the proportion of organic 
carbon in accreting sediments across our net- 
work was better explained by the 3000-year 
RSLR trend (7? = 0.23; P < 0.0001) than by 
contemporaneous RSLR (7° = 0.07; P < 0.0001; 
fig. S6). Sites with higher bulk density and 
lower percent organic carbon had lower rates 
of subsidence (table S1 and fig. $1) and a higher 
proportion of vertical accretion contributing to 
elevation gain, consistent with predictions (9). 
In these locations, shorelines were relatively 
stable and the proportion of vegetated to un- 
vegetated marsh cover (20) was high (table S1). 

The mechanisms promoting tidal marsh 
adjustment under low rates of sea level rise 
appear less effective under high rates of sea 
level rise. The substrates undergoing marsh 
elevation gain are increasingly subject to auto- 
compaction and subsidence under increased 
accretion and inundation depth. The elevation 
response is nonlinear, and above a primary 
breakpoint between 5 and 10 mm year” of 
vertical accretion, a higher proportional loss to 
subsidence constrains elevation adjustment in 
response to accelerating RSLR. This observa- 
tion reconciles the instrumental record with 
probabilities of tidal marsh adjustment emerg- 
ing from paleostratigraphic records during 
phases of high RSLR during the Holocene. 
Both datasets suggest a low likelihood (P < 
0.1) that tidal marshes will be retained in situ 
under global average rates of RSLR attained 
by mid-century under high-emissions scenarios, 
and by the end of the century under midrange- 
emissions scenarios (25). These rates of RSLR 
are already reached in subsiding deltas occu- 
pied by tidal marshes. Under these circum- 
stances, tidal marsh survival will increasingly 
depend on their upland migration. 
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SPECIES INTERACTIONS 


Pollinators of the sea: A discovery of animal-mediated 


fertilization in seaweed 


E. Lavaut’, M.-L. Guillemin’, S. Colin®4, A. Faure’, J. Coudret?, C. Destombe’, M. Valero™ 


The long-held belief that animal-mediated pollination is absent in the sea has recently been contradicted in 
seagrasses, motivating investigations of other marine phyla. This is particularly relevant in red algae, in which 
female gametes are not liberated and male gametes are not flagellated. Using experiments with the isopod 
Idotea balthica and the red alga Gracilaria gracilis, we demonstrate that biotic interactions dramatically increase 
the fertilization success of the alga through animal transport of spermatia on their body. This discovery 
suggests that animal-mediated fertilization could have evolved independently in terrestrial and marine 
environments and raises the possibility of its emergence in the sea before plants moved ashore. 


nimal-mediated pollination in plants is 

believed to have originated more than 

140 million years ago (J) [although this 

date still sparks debate (2)]. Insect pol- 

lination is the most common form of 
fertilization in flowering plants (3) and is usu- 
ally the unintended consequence of an ani- 
mal’s feeding activity. Although extensively 
studied on land, a demonstration of animal 
pollination in the marine environment was 
made in 2012 when foraging marine inverte- 
brates were shown to carry and transfer pollen 
grains from male to female flowers in the 
seagrass Thalassia testudinum K.D. Koenig 
(4, 5). This finding contradicted the long-held 
belief that animal-mediated pollination is ab- 
sent in the sea (6) and raised the question of 
its presence in other marine life, particularly 
in seaweeds. 

We investigated whether animal-mediated 
dissemination of male gametes occurs in 
seaweed. This question is particularly relevant 
in Florideophyceae because male gametes are 
not flagellated or motile, similar to pollen 
grains, and fertilization takes place on a re- 
mote female organ. The sperm (spermatia) 
cannot swim to attach to the trichogyne (the 
female receptive structure) and are typically 
thought to be dispersed only passively through 
water flows (7). 

The marine isopod Jdotea balthica (Pallas, 
1772) is commonly found firmly gripped to the 
red alga (Fig. 1A and movie S1) Gracilaria 
gracilis (Stackhouse) M. Steentoft, L.M. Irvine 
& W.F. Farnham. Using two sets of experi- 
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ments, we tested whether J. balthica facilitates 
male gamete dispersal and fertilization in this 
red alga species either indirectly, through the 
creation of water turbulence when swimming, 
or directly, by carrying spermatia on their 
bodies. Mating in G. gracilis occurs when 
spermatia released into the environment 
(Fig. 1B) encounter a trichogyne on the female 
thallus, where fertilization takes place. After 
fertilization, the zygote develops on the female 
alga into a complex structure visible to the 
naked eye called a cystocarp (fig. S1). We 
therefore used the number of cystocarps per 
centimeter of thallus as a proxy of fertiliza- 
tion success. 

In the first set of experiments, virgin 
reproductive-female thalli were placed in calm 
seawater aquaria 15 cm apart from reproductive- 
male thalli, either in the presence or absence 
of idotea isopods (Fig. 2, A and B, respec- 
tively). Fertilization success was about 20 times 
higher in the presence of idoteas than without 
(Wilcoxon rank-test, n = 5 replicates, P = 0.031) 
(Fig. 2C). In the second set of experiments, 
virgin females were placed in aquaria with or 
without idoteas that had been preincubated 
with reproductive-male thalli (Fig. 2, D and E, 
respectively). To control for the possibility that 
sperm contamination by the operator during 
animal transfer led to fertilization events, we 
performed at the beginning of the experiment 
shown in Fig. 2E a mock transfer that involved 
plunging the instruments used to handle idoteas. 
Only two cystocarps were detected—and in 
only one replicate—after the mock transfer (an 
average 0.002 cystocarp/cm) (table S1), indi- 
cating that such sperm contamination by the 
operator was negligible. Fertilization success 
rate reached 0.11 cystocarp/cm of thallus in the 
presence of incubated idoteas (Wilcoxon rank- 
test, n = 5 replicates, P = 0.028) (Fig. 2F), con- 
firming direct gamete transport by isopods. 
Idoteas were imaged by use of confocal laser 
scanning microscopy to demonstrate that they 
were indeed carrying spermatia. The presence of 
spermatia was revealed with staining (Fig. 1C). 
The cuticles of the animals were studded with 


spermatia (Fig. 1D, fig. S2, and movie S2), 
mainly adhering to the setae of pereiopods 
(Fig. 1E and movie S2) and on the margin seg- 
ments (Fig. 1F and movie S2) because of the 
sticky nature of the sperm mucilage (fig. S3). 

Our results demonstrate for the first time 
that biotic interactions dramatically increase 
the probability of fertilization in a seaweed. 
The observation of mucilage-embedded spermatia 
attached to idotea body parts after visiting male 
gametophytes suggests that idotea may serve 
as “pollinators” in G. gracilis upon subsequent 
visits to female individuals (movie S1). If we 
draw a parallel with the Cox-Knox postulates 
that define criteria for determination of pol- 
linators of flowering plants (8), our results are 
sufficient to prove that idoteas are efficient 
gamete vectors. 

This discovery is particularly relevant in 
Rhodophyta because male gametes are not 
flagellated and have a short life span (9), and 
more than 75% of species are dioicous (having 
male and female structures on different indi- 
viduals), which means that fertility success is 
distance dependent and limited by mate den- 
sity (0). Syngamy in red algae is expected to 
be relatively rare compared with that of marine 
organisms that possess external fertilization, 
the synchronous release of motile gametes, 
and chemotaxis (such as invertebrates and 
brown seaweeds), unless compensatory mech- 
anisms increase sperm production, sperm trans- 
fer, or zygote survival. Searles hypothesized 
that the evolution of a life history in which 
the female nurtures and protects the zygote is 
probably a response to low rates of fertiliza- 
tion in both plants and red algae (the cysto- 
carps) (7). Despite debate concerning the 
compensatory allocation of resources for 
zygote survival in red algae, the possibility of a 
biotic interaction that facilitates fertilization 
success in this group had not been previously 
explored. In particular, animal-mediated trans- 
port of spermatia in red algae could dramat- 
ically increase fertilization success in calm 
rock pools (a condition that was mimicked 
by our experimental conditions) at low tide, 
when most fertilizations have been detected 
(11). However, the relative importance of animal- 
mediated transport versus water movements 
in fertilization success was not tested. Inter- 
tidal seaweed could combine the advantages 
of both water and animal dispersal of male 
gametes when alternating between low- and 
high-tide conditions. 

Although I. balthica grazes on other sea- 
weed, it does not feed directly on Gracilaria 
but rather eliminates epiphytes at the surface 
of the thallus while protecting itself from 
predators (12-14). We suggest that the rela- 
tionship could be mutually beneficial. For 
I. balthica, the seaweed provides shelter, and 
epiphytic diatoms found adhering to the sur- 
face of the thallus, whose frustules (or thecae) 
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are found in the feces of idoteas (fig. S4), 
appear to be an important food source. In 
return, G. gracilis benefits through increased 
growth rate owing to reduced fouling (15) and 
improved reproductive success. The benefit of 
animal-mediated “pollination” may be partic- 
ularly important in species such as G. gracilis 
with funnel-shaped trichogynes that do not 
protrude from the surface of the thallus (76), 
making fertilization more difficult compared 
with that of other Rhodophyta such as Bostrychia 
moritziana (17), which have elongated tricho- 
gyne (about 100 um long). As in flowering 
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plants in which ambophily (adaptation to both 
wind and animal pollination) appears to be 
more important than previously reported (18), 
our results demonstrate that in addition to the 
well-established role of water movement 
(7, 9, 10), animal-mediated fertilization is also 
present in the sea. This observation opens up 
new unexplored avenues of research aimed at 
understanding the ecology of these interactions. 

Animal pollination is generally thought to 
have originated during the Mesozoic for gymno- 
sperms and angiosperms and was believed 
absent in mosses and ferns, in which fertiliza- 


Fig. 1. The idoteas carry spermatia of G. gracilis 
on their bodies. (A) /. balthica firmly clinging to the 
reproductive-male thallus of G. gracilis (oblique 
illumination, macroscopy) (movie S1). The light spots 
on the thallus correspond to the male conceptacles. 
(B) Release of spermatia (arrowhead) at the 
surface of the reproductive-male thallus (oblique 
illumination, macroscopy). (©) Spherical spermatia 
about 4.5 um in diameter (arrowheads) with their 
noticeable bump [epifluorescence microscopy, staining 
with the lectine Concanavalin A conjugated to 

the fluorochrome (AlexaFluor488); ConA-AF488 
binds to specific carbohydrates (24) on the surface 
of the cell and mucilage]. (D) Three-dimensional 
econstruction from confocal laser scanning 
microscopy of a young specimen of I. balthica 
collected on a reproductive-male gametophyte 

of G. gracilis (tegument of the isopod was fixed and 
stained with calcofluor white, and the spermatia 

was stained with ConA-AF488 (figs. S2 and S3 

and movie S2). (E) Close-up of (D) right inset, showing 
location of spermatia on the setae of pereiopods. 

(F) Close-up of (D) left inset, showing location 

of spermatia on the margin of segments (arrows). 
Scale bars, (A) 1 mm; (B) and (D) 200 um; and (C), 
(E), and (F) 20 um. 


tion is restricted to water (19-21). However, 
transport of male gametes by animals has 
been demonstrated recently in mosses, ante- 
dating animal-mediated pollination at the early 
phase of land colonization (~ 450 million years 
ago) (21). The discovery of animal-mediated 
fertilization in Florideophyceae, whose origin 
was estimated to be between 817 million and 
1049 million years ago (22), suggests that it could 
have arisen with the diversification of the meta- 
zoan some 650 million years ago (23), even before 
plants moved ashore. However, we cannot rule 
out that different animal-mediated fertilization 
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(B) Males and females 
without idoteas 


(A) Males and females 
with idoteas 


mechanisms evolved independently and repeat- 
edly in terrestrial and marine environments. 
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SOLAR CELLS 


Inactive (Pbl,)2RbCI stabilizes perovskite films for 


efficient solar cells 


Yang Zhao'”, Fei Ma’, Zihan Qu’, Shigi Yu’, Tao Shen”, Hui-Xiong Deng”, Xinbo Chu?, 


Xinxin Peng*, Yongbo Yuan*°, Xingwang Zhang", Jingbi You 


1,2% 


In halide perovskite solar cells the formation of secondary-phase excess lead iodide (Pbl2) 

has some positive effects on power conversion efficiency (PCE) but can be detrimental to 
device stability and lead to large hysteresis effects in voltage sweeps. We converted Pbl2 into 
an inactive (Pbl2)2RbCI compound by RbCI doping, which effectively stabilizes the perovskite 
phase. We obtained a certified PCE of 25.6% for FAPbl3 (FA, formamidinium) perovskite 

solar cells on the basis of this strategy. Devices retained 96% of their original PCE values after 
1000 hours of shelf storage and 80% after 500 hours of thermal stability testing at 85°C. 


everal strategies can enhance the power 

conversion efficiency (PCE) of perov- 

skite solar cells (PSCs), such as growing 

films with high crystallinity (7, 2), dop- 

ing with anions, cations, or both (3-5), 
engineering the charge transport layer (6-12), 
and use of a universal passivation layer (13-16). 
Several groups have reported certified PCEs 
>25% by modulating the SnO, electron trans- 
port layer (10-12). 

Previous reports have shown that the sec- 
ondary phase of PbI, is critical for attaining 
high performance levels (7, 12, 17-24). The 
benefits of excess PbI,—either in the bulk or 
at the grain boundary of perovskite films— 
have mainly been attributed to its passivation 
effect resulting from the formation of a type-I 
band alignment (24). However, excess PblI, is 
too active and leads to instability of the device 
and large hysteresis in current density voltage 
(J-V) characteristics (18, 23, 25, 26). Both fac- 
tors can be ascribed to the photodecomposition 
of PbI, and the enhanced ionic migration. 

It is crucial to manage excess unstable PbI, 
in perovskite films to achieve high efficien- 
cy. Recently, Luo et al. developed a ligand- 
modulation approach to manage the excess 
PbI, and successfully improved both PCE (up 
to 22%) and stability (23). 

We propose stabilization of the perovskite 
by converting excess PbI, into an inactive, 
new, secondary-phase (PbI,).RbCI that can 
improve the stability of the perovskite FAPbI; 
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layer (for simplicity, the perovskite film with 
(PbI,)2RbCl is henceforth referred to as PIRC), 
in addition to improving its optoelectronic 
properties, the reduction of PbI, decreased the 
bandgap of the perovskite layer. As a result, a 
26.1% efficiency (certification efficiency: 25.6%) 
of perovskite solar cells with >1000 hours of 
shelf stability and 500 hours of thermal sta- 
bility at 85°C have been developed. 

We deposited nearly pure FAPbI; by a mod- 
ified two step method used in our previous 
reports (7, 15, 18). For deposition of FAPbI3, the 
PbI, seed layer was first grown on the sub- 


strate while the organic salt formamidinium 
iodide (FAI) solution was deposited on PbI, 
for diffusion and formation of perovskite by 
annealing. Further, to improve the perovskite 
crystallization, methylammonium chloride 
(MACI) was added to the FAI solution (7, 27). 
In contrast to our previous reports, to enhance 
the thermal stability and decrease the band- 
gap of perovskite as much as possible, tradi- 
tional organic salt methylammonium iodide 
(MAI) or methylammonium bromide was not 
used in this study. We confirmed that a 5% 
molar ratio of RbCl doping is the best condi- 
tion for device performance, as discussed below. 

We performed scanning electron micros- 
copy (SEM) (Fig. 1, A and B) measurements to 
determine the perovskite film crystallinity. For 
the control film, the perovskite crystal grain size 
was ~1 um and the grains were surrounded by 
PbI, grains (white needle shapes, Fig. 1A), 
which formed as decomposition products 
on the perovskite surface after heating at 
elevated temperatures. For the PIRC films, the 
perovskite crystal size was increased to ~2 um. 
Instead of a large fraction of evenly distributed 
white needle-shaped grains, sporadically dis- 
tributed white irregular flakes were observed 
(Fig. 1B). Energy dispersive spectroscopy map- 
ping of the PIRC film (fig. S1) revealed that 
the white region contained Pb, I, Rb, and Cl, 
but no N. This result indicated that there was 
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Fig. 1. Microstructures and morphology of secondary-phase (Pbl2)2RbCI on perovskite. (A and B) SEM 
of perovskite films without RbCI and with 5% RbCl, respectively. Scale bars, 5 um. (C) Cross-section SEM 
image of a completed device using RbCI doped perovskite. Scale bar, 1 um. (D) XRD of perovskite without 


RbCl and with 5% RbCl. 
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no organic-inorganic hybrid FAPbI; perov- 
skite phase in this region whereas a PbI,/RbCl 
mixed phase was possible (fig. S1 and tables S1 
and S82). The cross-section SEM images showed 
that the new secondary phases only formed on 
top of the perovskite films (Fig. 1C). 
Conventional x-ray diffraction (XRD) for the 
perovskite films (Fig. 1D) showed that the 
main diffraction peak of the control film could 
be assigned from the black phase of FAPbI3 
located at 14°. The yellow phase of FAPbI,; 
(11.7°) was also observed in addition to the 
PbI, diffraction peak (12.6°) (the inset of Fig. 1D 


shows the enlarged figure). The PbI, diffraction 
peak in the PIRC film was largely suppressed 
and a diffraction peak (11.3°) emerged (Fig. 1D, 
inset). Grazing-incidence wide-angle x-ray scat- 
tering (GIWAXS) of the PIRC films were also 
consistent with conventional XRD results (fig. 
$2). We directly compared the XRD pattern of 
the annealed PbI,/RbCl precursor films with 
different ratios (fig. S3), and we found that the 
diffraction of PbI, almost disappeared where- 
as the diffraction intensity at 11.3° was greatly 
enhanced when the ratio of PbI,/RbC1 is 2. 


talline (PbI,)2RbCl. Density function theory 
calculations and powder diffraction results 
also confirmed this assignment (fig. S4). 
To study the phase stability of the perov- 
skite materials obtained we aged the perovskite 
films at 85°C for 48 hours. The XRD and SEM 
results showed that more PbI, was formed after 
heating of the control perovskite film (figs. S5 
and S6). We found that integration of the XRD 
intensity ratio of PbI,/perovskite in the con- 
trol film increased from 0.42 to 0.98 after 
aging whereas the PIRC film remained at ~0.10. 


Thus, we could attribute the 11.3° peak to crys- 


These results indicate that excess PbI, can 
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Fig. 2. Properties of control and PIRC perovskite films including ion 
transport, composition, and photoluminescence. (A and B) Temperature- 
dependent conductivity measurements of control and PIRC perovskite films, 
respectively. (C to E) XPS spectra of Rb 3d, Cl 2p, and Pb 4f core energy 
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control and PIRC samples; 


surface passivation films. 


levels in control and PIRC films, respectively. (F) XPS depth profile of Cl in the 


the Rb depth in the PIRC sample is also included. 


(G and H) Steady PL and TRPL of control perovskite film, PIRC film, and their 
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trigger decomposition of the perovskite layer 
(18, 25, 26, 28, 29) whereas the perovskite 
becomes much more stable after conversion 
of PbI, into the (PbI,).RbCl phase. 

The presence of PbI, in the perovskite layer 
could lead to ion migration as a result of the 
formation of Schottky defects such as FA and I 
vacancies (30). We measured the temperature- 
dependent conductivity with a previously 
reported procedure (37) to confirm that ion 
migration is suppressed in the PIRC films 
through reduction of PbI, (Fig. 2, A and B). 
We can estimate the ion migration active en- 
ergy based on the Nernst-Einstein relation: 


oT = ooexp(—Eu/KksT) 


where 6g is a constant, kz is Boltzmann’s con- 
stant, o is ionic conductivity, T is temperature, 
and £, is the ion-migration activation energy. 
We derived E,, values from the slope of the In 
(oT) versus 1/T of ~0.32 eV for the control 


films and ~1.03 eV for the PIRC films. Given 
the E,, we can estimate that the ion migration 
rates (kX) at the operating temperature (50°C) 
are 4.2 x 10° s-! and 3.3 x 10°? s"!, respec- 
tively, based on the Arrhenius equation: 


ke = (ItpT)/h exp(—Ea/RT) 


where kg stands for the Boltzmann constant, 
T is temperature, / is the reduced Planck con- 
stant, and R is the ideal gas constant (32). 
We found that (PbI.).RbCI could not easily 
react with FA cations or I ions compared with 
those of PbI, (fig. $7), inferring that (PbI,),.RbCl1 
lacks similarity to PbI, in regards to acting as 
a sink for absorbing adjacent FA and I during 
heating, photo soaking, or voltage application. 
This will suppress FA and I vacancy forma- 
tion, as well as ion migration. Another possible 
reason for suppression of ion migration as 
a result of (PbI,).RbCl is that as the deriva- 


tive of PbI,, the (PbI,).RbCI is a fully inter- 


calated compound (fig. S4) with narrow ion 
migration pathways whereas PbI, is a layer 
structure crystal with enough interlamellar 
space for ion migration. 

X-ray photoelectron spectroscopy (XPS) char- 
acterization of the PIRC film showed the ex- 
pected Rb 3d core level peak (Fig. 2C). For Cl, a 
substantial difference was found between the 
control and PIRC films. We estimated ~2.7% Cl 
in the control film (Fig. 2D), resulting from the 
addition of residual MAC] during perovskite 
film growth, although most MAC] would have 
evaporated during annealing. In these perov- 
skite films, the radius of Cl was too small to 
incorporate into the lattice. We estimated that 
~5.3% Cl was left in the PIRC film that was 
bonded with Pb or Rb ions and could not be 
easily evaporated (Fig. 2D). This excess Cl 
could stabilize the perovskite phase through 
strong Pb-Cl bonding (8-10). The presence of 
RbCl also enhanced the formation of the black 
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perovskite phase (Fig. 1D). Previous studies 
showed that the metallic Pb formed from PbI, 
under high-energy illumination drives loss of I, 
(28, 29). We found that the metallic state of Pb 
was suppressed after introducing PIRC (Fig. 2E), 
which could be due to much better photo sta- 
bility of PIRC compared with PbI, (fig. $8). 

Depth profile XPS for the PIRC films re- 
vealed that the top surface was Rb- and Cl-rich 
(Fig. 2F and fig. S9), which could be a result of 
the secondary phase having a different solu- 
bility in the precursor solvent. Time-of-flight 
second ion mass spectroscopy (ToF-SIMS) of 
the control and target perovskite films (fig. S10) 
also confirmed that Rb and Cl that convert PbI, 
into (PbI,)2RbCl were mainly distributed on 
top of the perovskite. The Cl ratio on the bottom 
of the perovskite is about 1% higher than that in 
the bulk estimated from ToF-SIMS (fig. S10) and 
XPS results (Fig. 2F). 

Excess PbI, can increase the perovskite band- 
gap (7, 12, 17), which could be attributed to the 
strong quantum confinement in perovskite/Pbl». 
The photoluminescence (PL) emission of the con- 
trol film peak at 805 nm red shifted to 810 nm in 
the PIRC film (Fig. 2G). The PIRC perovskite 
film showed enhanced PL intensity indicative 
of suppression of nonradiative recombination. 
Time-resolved photoluminescence (TRPL) 
showed that the lifetime of the perovskite in- 
creased from 0.98 us to 2.3 us for the PIRC 
films and that this lifetime can be extended to 
~6 us with the addition of a passivation layer 
on the perovskite surface (Fig. 2H). 

We fabricated the solar cell devices in an n-i-p 
configuration of FTO/SnO./perovskite/passivation 
layer/Spiro-OMeTAD/Au [FTO, fluorine-doped 
tin oxide; spiro-OMeTAD, 2,2’,7,7'-tetrakis(N,N- 
dipmethoxyphenylamine)-9,9’-spirobifluorene; 
Fig. 1C]. After testing the device performance 
using perovskite films with 1 to 10% RbCl 
doping we found that the 5% RbCl device 
had the best performance (fig. S11 and table 
$3); additionally, improved performance was 
observed when Cl was used as the halide (fig. 
$12 and table S4). A representative J-V curve 
for the control and 5% RbCl-doped device 
showed that each of the photovoltaic param- 
eters increased (Fig. 3A). The typical short 
circuit current density (J/sc) increased from 
25.1 to 25.8 mA/cm?, which could be attributed 
mainly to the lower bandgap (Fig. 2G) as well as 
the external quantum efficiency (EQE) (Fig. 3B). 
The open circuit voltage Voc increase from 1.17 
to 1.19 eV despite the bandgap decrease was due 
to the suppression of recombination. The fill 
factor (FF) was almost the same or slightly 
improved compared with the control device. 

Overall, the PCE increased from 24.6 to 
25.6%. In addition, the PIRC-based device was 
almost hysteresis-free whereas the control de- 
vice showed obvious hysteresis. The highest PCE 
for an optimized device tested in our laboratory 
was 26.1% (Fig. 3C); said device also showed 
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25.7% efficiency and steady output for 1200 s 
(device area: 0.108 cm?; measurement mask 
size: 0.0741 cm”). The statistics of the PCE dis- 
tribution confirm the reproducibility of our 
results (fig. S13). These devices, which showed 
25.6% certificated efficiency with a Voc of 1.182 V, 
aJgc of 26.3 mA/cm?, and a FF of 82.7% (figs. 
S14 and S15), were sent to the accredited lab- 
oratory in Newport, USA, for certification. 

The Voc reflects the recombination rate of 
a given device and can be measured through 
the EQE at short circuit current conditions 
when the device is tested as a light-emitting 
diode. These devices had electroluminescence 
(EL) efficiencies as high as 21.6% under the 
injection current of 26 mA/cm” (equal to short 
circuit current J,,) (Fig. 3D). High quantum 
efficiency can only be observed through charge 
confinement in light-emitting diodes, so the 
high-EL efficiency indicates that the defect- 
related nonradiative recombination was largely 
suppressed in the tested device. Light intensity- 
dependent results (fig. $16) also confirm this 
conclusion (33) and are consistent with our de- 
vice simultaneously delivering high J,. and Vo. 

We tracked device shelf stability and hyster- 
esis in a nitrogen environment while using the 
Spiro-OMeTAD as the hole transport layer. 
After 1004 hours of aging, a slight drop in the 
FF from 82.6 to 80.6% was observed for the 
RPIC strategy-based device, which could be 
attributed to the reduction of Spiro-OMeTAD 
in an oxygen-free environment (34, 35); fur- 
ther, it had 96% of its original PCE. The device 
maintained almost the same PCE in both 
reverse and forward scans (Fig. 3E). However, 
the control device could only maintain its re- 
verse scan PCE of 24.1% whereas the forward 
efficiency was reduced to 22.9%. We attributed 
this enhanced hysteresis (fig. S17 and table S5) 
to the spontaneous formation of PbI, as a re- 
sult of aging and enhanced ion migration. 

We also tested accelerated aging stability while 
heating the devices at 85°C in a nitrogen envi- 
ronment. We removed other possible sources of 
instability such as the organic passivation layer 
and replaced Spiro-OMeTAD with Poly[bis(4- 
phenyl)(2,4,6-trimethylphenyl)amine]. We found 
that the PIRC-targeted devices primarily showed 
improved stability. Under 500 hours of contin- 
uous heating, the device maintained 80% initial 
performance (Fig. 3F and table S6) whereas the 
control device only retained 50% of its initial 
PCE. The PIRC method proposed in this study 
could enhance both stability and device effi- 
ciency which will provide a new direction and 
push the development of perovskite solar cells; 
there is also potential for use in other opto- 
electronic devices such as light-emitting diodes 
and photodetectors. 
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CARBON NANOTUBES 


DNA-guided lattice remodeling of carbon nanotubes 


Zhiwei Lin’*+, Leticia C. Beltran”, Zeus A. De los Santos’, Yinong Li*, Tehseen Adel‘, 
Jeffrey A Fagan’, Angela R. Hight Walker‘, Edward H. Egelman’, Ming Zheng’* 


Covalent modification of carbon nanotubes is a promising strategy for engineering their electronic 
structures. However, keeping modification sites in registration with a nanotube lattice is challenging. 
We report a solution using DNA-directed, guanine (G)-specific cross-linking chemistry. Through DNA 
screening we identify a sequence, C3GC7GC3, whose reaction with an (8,3) enantiomer yields minimum 
disorder-induced Raman mode intensities and photoluminescence Stokes shift, suggesting ordered 
defect array formation. Single-particle cryo—electron microscopy shows that the C3GC7GC3 
functionalized (8,3) has an ordered helical structure with a 6.5 angstroms periodicity. Reaction 
mechanism analysis suggests that the helical periodicity arises from an array of G-modified carbon- 
carbon bonds separated by a fixed distance along an armchair helical line. Our findings may be 

used to remodel nanotube lattices for novel electronic properties. 


ynthesis of carbon-based quantum ma- 

terials can, in principle, employ rich or- 

ganic chemistry to realize novel properties 

through atomic-precision structural engi- 

neering. More than 50 years ago, W. A. 
Little proposed a room-temperature organic 
superconductor model composed of a one- 
dimensional conducting chain with an array 
of polarizable side chains attached (J). Since 
then, numerous efforts to verify Little’s pro- 
posal had been made but all have failed (2). 
In 2016 the mechanism underlying Little’s 
model—electron attraction mediated by po- 
larizable groups—was confirmed for the first 
time (3). The study used a single-wall carbon 
nanotube (SWCNT) as the one-dimensional 
conducting chain, along which a nanotube 
circuitry was constructed to provide a single 
polarizable “side chain.” This work suggests 
a route to the Little model by chemically im- 
planting polarizable groups in registration 
with a SWCNT lattice (4). However, there ex- 
ists a major challenge in controlling reaction 
sites along the nanotube, which seems in- 
surmountable as half the population of car- 
bon atoms on a given SWCNT are chemically 
equivalent and are enantiomers of the remain- 
ing half. We report a DNA-guided chemical 
reaction to overcome this challenge. We screen 
reaction products by resonance Raman and 
photoluminescence (PL) spectroscopy and 
inspect the structures of promising candidates 
by single-particle cryo-electron microscopy 
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(cryo-EM). Our findings demonstrate the fea- 
sibility of not only creating a wide range of 
SWCNT derivatives in general but also build- 
ing a Little model in particular. 

Broadly speaking, covalent modification of 
SWCNTs is a promising route toward organic 
quantum materials (5-7). With all atoms on 
their surfaces, SWCNTs are more amenable 
than other solid state materials to precision 
molecular engineering by wet chemistry. In 
addition, various chiral forms of SWCNTs made 
available by sorting (8) offer a diverse range 
of electronic structures for further chemical 
tailoring. Recently, Weisman e¢ al. reported 
a photochemical reaction of DNA-wrapped 
SWCNTs with singlet oxygen that covalently 
links guanine (G) to the side wall of SWCNTs 
(6). Even though the chemical nature of the 
covalent link was not fully revealed and the 
structure of the reaction product was rather 
disordered (9), we are nevertheless motivated 
to pursue ordered SWCNT lattice modifica- 
tion with the goal of finding more effective 
ways to explore nanotube chirality and DNA 
sequence space. 

To speed up sequence screening and to pro- 
mote ordered structure formation, we have 
explored conditions for the guanine func- 
tionalization reaction of SWCNTs. We find 
that the previously reported aqueous phase 
photochemical reaction of Rose Bengal (RB)- 
mediated guanine cross-linking with SWCNTs 
(6) also works in mixed solvents containing 
both methanol and water. In 50% v/v metha- 
nol, the reaction proceeds slower (60 min) 
than that in water (15 min); further, consump- 
tion of RB sensitizer is decreased considerably 
and RB binding to SWCNTs—which may ad- 
versely affect DNA wrapping structures and 
cross-linking sites—is also minimized. All these 
changes should favor homogeneous product 
formation. Combining this new reaction con- 
dition with a previously established process 
(10) for DNA-surfactant exchange, we have de- 
vised a one-pot chemistry (Fig. 1A and figs. 


S1 to S5) that can efficiently react any G- 
containing sequence with any single-chirality 
SWCNT species [e.g., (6,5), (9,1), (8,3), and so 
on] purified independently through various 
techniques (8). Excess DNA is present during 
the reaction to refill exposed nanotube sur- 
faces arising from reaction-induced DNA struc- 
ture contraction, eliminating a potential source 
of inhomogeneous functionalization. 

We have employed spectroscopic tools to 
monitor the reaction (Fig. 1 and figs. S6 and 
S7). Figures 1B to 1D present a data set from a 
reaction on an (8,3) enantiomer (normalized 
circular dichroism signal at the E45 = - 42 mdeg, 
which according to theoretical analysis cor- 
responds to a right-handed enantiomer) (J1). 
Hereafter we denote the enantiomer simply 
as (8,3) unless indicated otherwise. Figures 
1B and IC respectively show the expected red 
shift of the £;, absorption and PL peak after 
functionalization. Figure 1D shows resonance 
Raman spectra of unfunctionalized (black 
trace) and functionalized (8,3) (red trace). In 
addition to the well-documented D peak, we 
find another disorder-induced peak: the in- 
termediate frequency mode (IFM) (12, 13) at 
387 cm, with overtone 2 IFM at 775 cm”! and 
combination modes with D: D + IFM at 905 
and 1680 cm’, respectively. There is consid- 
erable intensity enhancement of the D and 
IFM modes in the functionalized tubes. By 
contrast, 2IFM and D + IFM peaks remain 
weak and unchanged after functionalization, 
consistent with their origin from two-phonon, 
second-order scattering processes (12). 

To gain insight into the reaction mecha- 
nism, we have functionalized (8,3) using a 
set of DNA sequences with varying G content. 
With increasing G content the functionalized 
(8,3) shows a gradual increase in both D and 
IFM peak intensity (Fig. 2, A and B) in ad- 
dition to more red-shifted and broadened PL 
and absorbance peaks (Fig. 2, C and D). These 
observations are consistent with a previous 
study which used mixed chirality tubes (6). 
However, using chirality-pure SWCNTs allows 
us to unambiguously determine the absorp- 
tion and PL peak positions of the functional- 
ized tube as well as calculate its absorption 
peak shift and Stokes shift (SS), i-e., the energy 
difference between the absorption and PL 
peak, as a function of G content or defect den- 
sity. The absorption peak shift increases with G 
content (Fig. 2E), but its amplitude is <27 meV, 
about ten times as small as that typically ob- 
served for an sp? defect (74). This difference 
is notable considering that the defect density 
(=10°/nm) in guanine-modified tubes (75) is 
about two orders of magnitude higher than 
that in sp®-modified tubes (5). The SS also 
increases with defect density (Fig. 2E), a trend 
opposite that observed for sp® defects (14). In 
addition, we find a quantitative relationship 
between SS and full width at half maximum 
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Fig. 1. One-pot photochemical reaction scheme and spectral characteriza- 
tion. (A) Reaction scheme starting from resolving DNA-wrapped to guanine- 
functionalized (guanine-lized) SWCNT carried out in a single pot. DOC, sodium 
deoxycholate. (B) Absorption (normalized at E33). (©) PL (with peak intensity 


(W) of the PL peaks, suggesting that modified 
and unmodified carbon atoms are isovalent as 
discussed below. In Fig. 2F we plot SS versus 
W? and fit the data well (R? = 0.98) with T = 
349.3 K using Eq. 1, where # is the Boltzmann 
constant and T is the effective exciton temper- 
ature equal to or above the ambient temperature. 
w? w? 

8n2kT a kT () 

Eq. 1 has been used to describe excitons in 
two-dimensional (2D) quantum wells and 
3D alloy semiconductors in which disordered 
isovalent substitution creates shallow traps 
(6, 17). It attributes the observed SS to the ther- 
malization of excitons in an inhomogeneously 
broadened band. Combining all of these spec- 
troscopic observations, we exclude sp? defect 
formation by guanine functionalization and 
conclude that the chemistry employed creates 
a modified sp” defect that is isovalent to the 
original sp” carbon in the pristine SWCNT. 

To differentiate the two types of disorders 
originating from the defect density itself and 
defect distribution pattern, we have designed a 
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set of 15-mer G/C sequences containing two Gs 
separated by a varying number of Cs (Fig. 3A) 
for the (8,3) functionalization chemistry. We 
find that disorder-induced D and IFM peak 
intensities change as a function of inter-G spac- 
ing, reaching an unexpectedly deep minimum 
when the spacing is 7 (Fig. 3, B and C). The SS 
and PL peak linewidth also show minimum 
values at that spacing (Fig. 3, D and E). These 
observations appear to be dependent on nano- 
tube chirality as left- and right-handed (6,5)s 
functionalized by the same set of sequences 
yield different spectral patterns (fig. S7). 

We offer a qualitative analysis of the data 
shown in Fig. 3. According to the mechanism 
of defect-induced Raman modes (72, 18), the 
observed peak intensity is proportional to the 
extent of elastic electron scattering by defects. 
For the sequences used in our study we esti- 
mate that a SWCNT contains 1 to 5 guanine- 
modified sites per nanometer tube length based 
on molecular dynamics simulations of a typi- 
cal DNA wrapping structure on a SWCNT 
(15). Because excitons are 2 to 3 nm in size 
(19) electron scattering is expected to involve 


400 


800 1200 1600 
Raman Shift (cm"') 


2000 


normalized to 1). (D) Resonance Raman spectra (normalized at the “G” peak) of 
(8,3) before and after the reaction. Spectra in (C) and (D) are measured with 671 nm 
excitation corresponding to E22 of (8,3). The DNA sequence used in this experiment 
is (GCC)12. See fig. S8 for data shown in (B) and (C) before normalization. 


multiple defect sites and is sensitive to not 
only the defect density but also the degree of 
order in the defect array. The minimum D and 
IFM peak intensities shown in Fig. 3 are thus 
interpreted as resulting from an ordered defect 
array generated by the 2G-7 sequence. This 
conclusion is also consistent with the observed 
minimum SS for the same sequence, as SS is 
another measure of disorder for semiconduc- 
tors with isovalent substitution (J6, 17). 

To independently evaluate the spectroscopy- 
derived result, we have applied single-particle 
cryo-EM to measure a 2G-7-(8,3) hybrid struc- 
ture before and after functionalization. Cryo-EM 
imaging of 2G-7 functionalized (8,3) reveals 
filaments ~20 A in diameter (fig. S11). In the 
nonfunctionalized structure coated with 2G-7, 
an averaged power spectrum from ~ 2 x 10° 
images of particle segments yields no detect- 
able features (fig. S12), implying a disordered 
DNA wrapping structure. In the functionalized 
structure we have detected a layer line pattern 
characteristic of a one-start helix with a 6.5 A 
helical pitch visible from the averaged power 
spectrum of selected segments (Fig. 4A). The 
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Fig. 2. Spectroscopic characterization of (8,3) functionalized with DNA of 
varying G content (see table S1 for sequence information). (A and B) IFM 
and D peak intensity (normalized by the intensity of the “G” peak) profile. (© and 
D) PL and absorbance profile. (E) Absorbance shift and SS profile. (F) SS versus 


W? and a linear fit (R* = 0.98) using Eq. 1 with T = 349.3 K. Error bars shown here 
and elsewhere in this work are derived from three independently measured 
values for each of three independently prepared samples. Spectra in (A) and (C), 
are measured with 671 nm excitation. See fig. S9 for original data. 


helical structure is also visible in the 2D class 
average (Fig. 4B). Ab initio reconstruction—an 
unbiased and reference-free approach—was 
employed to generate a 3D reconstruction of 
the DNA-coated nanotube (Fig. 4C). Figure 4D 
shows a highly averaged version of that recon- 
struction, with an atomic model for the SWCNT. 
Because of the low signal-to-noise ratio in these 
cryo-EM images and the apparent lack of any 
other periodicities present, we have imposed a 
helical averaging of the density along the 6.5 A 
pitch helix. The unfiltered 3D map (Fig. 4C) 
shows a coherence length of ~10 nm, shorter 
than the full length of the segments used. The 
density in Fig. 4, C and D, is shown as a left- 
handed helix although true handedness was 
not determined. 

The 6.5 A periodicity observed by cryo-EM 
provides support for the model of ordered de- 
fect array formation and is also a valuable clue 
regarding the mechanism of guanine func- 
tionalization. As discussed earlier, a correct 
reaction mechanism should yield a modified 
carbon that largely maintains its sp? char- 
acter. Indeed, such chemistry has been well 
documented (20, 27) and forms the basis for 
our proposal as shown in Fig. 4, E and F, and 
fig. S14B. Guanine oxidation by singlet oxygen 
has a plethora of pathways and products de- 
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pending on reaction conditions and structural 
context but a common initial step is oxidation 
of the Cg carbon on the imidazole ring (22). We 
propose that the Cg carbon becomes electro- 
philic upon oxidation and then undergoes a 
2+1 cycloaddition with a nearby C-C bond on 
a SWCNT to yield a three-membered ring. 
This is followed by C-C bond cleavage and 
concomitant ring opening as a result of ring 
strain, leaving Cg to bridge the two carbons 
from the SWCNT and restore their sp? char- 
acter. Theoretical calculations (20) predict that 
this type of reaction is most favorable on C-C 
bonds with large curvatures or bond strains, 
consistent with a previous observation from 
the guanine functionalization chemistry [figure 
2C in (6)]. In (8,3) there are three types of C-C 
bonds with distinct curvatures; of these, C-C 
bonds along the helical armchair line shown in 
Fig. 4E possess the largest curvature. We note 
that the pitch (p) of this helical line has an 
intrinsic length scale of (8,3) determined solely 
by its chiral index (n, 7) and C-C bond length 
Ae: P= aVn +m +nm = 6.45 A 
(for a, = 1.44 A), matching that observed by 
cryo-EM. We therefore propose a 2G-7 wrap- 
ping structure in which each of the 2Gs is 
covalently linked to a C-C bond along the arm- 
chair line, resulting in pinning of the DNA back- 


bone along the same armchair line. Consistent 
with this functionalization-induced DNA pin- 
ning is our observation that 2G-5 and 2G-6 
functionalized (8,3) also exhibit the same 6.5 A 
helical pitch (fig. $13). Manual model build- 
ing for 2G-7 functionalized (8,3) followed 
by energy minimization yields a model as 
shown in fig. S14A, in which two adjacent 
G modification sites are separated by five 
C-C bonds along the armchair line. This equal 
G spacing explains the minimum spectroscopy- 
derived structure disorder for 2G-7 function- 
alized (8,3). 

In summary, we show that ordered SWCNT 
modification can be achieved by taking ad- 
vantage of DNA sequence control over the 
spacing between adjacent reaction sites, and 
SWCNT’s bond curvature-dependent reac- 
tivity. Our findings demonstrate the chemical 
feasibility of building a Little model. DNA- 
guided remodeling breaks the original sym- 
metry of the nanotube lattice and therefore 
should lead to new modes of low-energy elec- 
tronic excitation. Theoretical analysis shows 
that helical modification of a SWCNT in reg- 
istry with its lattice may induce topological 
electronic behavior (23, 24), suggesting that 
the chemistry we report here might be used 
to explore topological physics. We envision 
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Fig. 3. Screening DNA sequences for the ordered defect array. (A) DNA sequence used for the screening. (B) D and IFM peak intensity profile. (€) Raman spectra 


of 2G-5, 2G-6, and 2G-7 functionalized (8,3) (normalized by the “G” peak). (D) SS and W profile. (E) PL spectra of the three samples. Spectra in (C) and (E) are 
measured with 671 nm excitation. See fig. S10 for original data. 
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Fig. 4. Cryo-EM-derived structure model for 2G-7 functionalized (8,3) and reaction mechanism. (A) Averaged power spectrum from ~44,000 particles. Red 
arrow pointing to a layer line with a spacing of 1/(6.5 A) from the equator. (B) Image of 2D class average. (C) Low-resolution 3D map generated by unbiased, 
reference-free approach displaying coherence that extends over an axial length of = 10 nm. (D) 3D map corresponding to averaged density along the 6.5 A pitch helix. 
(E) Carbon-carbon bonds that have maximum bond curvature, highlighted in blue along an armchair helical line of (8,3), along which every sixth carbon-carbon 
bond is modified by a guanine (red balls) according to modeling (fig. SI4A). (F) A proposed reaction mechanism. 
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that future work will introduce diversity of 
functional groups, increase the coherent length 
of ordered modification, and eventually enable 
discovery of organic quantum materials and 
carbon-based, metal-free catalysts (25, 26). 
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SOLID-STATE IONICS 


Nanosecond protonic programmable resistors for 


analog deep learning 


Murat Onen**, Nicolas Emond@°, Baoming Wang”, Difei Zhang’, Frances M. Ross”, Ju Li2>**, 
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Nanoscale ionic programmable resistors for analog deep learning are 1000 times smaller than 
biological cells, but it is not yet clear how much faster they can be relative to neurons and synapses. 
Scaling analyses of ionic transport and charge-transfer reaction rates point to operation in the 
nonlinear regime, where extreme electric fields are present within the solid electrolyte and its 
interfaces. In this work, we generated silicon-compatible nanoscale protonic programmable resistors 
with highly desirable characteristics under extreme electric fields. This operation regime enabled 
controlled shuttling and intercalation of protons in nanoseconds at room temperature in an energy- 
efficient manner. The devices showed symmetric, linear, and reversible modulation characteristics with 
many conductance states covering a 20x dynamic range. Thus, the space-time-energy performance 
of the all—-solid-state artificial synapses can greatly exceed that of their biological counterparts. 


queous ionics with mobile Na‘, K*, Ca", 

and other ions underpin biological in- 

formation processing. In neurons and 

synapses, the action potential of mag- 

nitude ~100 mV evolves over a char- 
acteristic time scale of milliseconds, which 
fundamentally limits the speed of thinking 
and reflexes of animals. Because liquid water 
decomposes at voltages >1.23 V, a weak action 
potential is understandable. However, with 
human-made solid-state neurons and syn- 
apses (1-3) for analog machine learning, re- 
search efforts are no longer limited by the 
stability window of aqueous electrolyte. Fur- 
thermore, it is also possible to fabricate de- 
vices that are much smaller than biological 
neurons, by a length-scale shrinkage factor 
of 10° (from ~10 pm to ~10 nm). This pos- 
sibility raises a fundamental question about 
how much faster we can train such “artificial 
synapses”—i.e., what is the ultimate speed 
limit to solid-state ionics-based analog deep 
learning? Also, when we approach that speed 
limit, how energy intensive would the training 
be, given that dissipative processes generally 
produce more entropy per task the faster that 
task is accomplished? 

Fundamental ionics arguments seem to call 
for high voltage and small length scales—that 
is, an extreme programming field approach 
(4-10). Transport of ions (such as H") inside a 
solid electrolyte (SE) layer and a mixed ionic- 
electronic conductor (MIEC) conductance 
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channel layer, as well as charge-transfer re- 
actions at the SE/MIEC interfaces, scale mono- 
tonically and nonlinearly with the applied 
voltage. Therefore the speed of change in the 
nonvolatile state of artificial neurons should 
increase with increasing electric field. How- 
ever, there is a dielectric breakdown limit (77) 
to solids, as well as a thermodynamic electro- 
chemical stability window for SEs. Therefore, 
the programming field should be raised as 
high as possible, but not so high as to per- 
manently damage the SE. In this work, we 
demonstrate that such an approach produced 
exceedingly fast nanoionic devices (5 ns), at 
least 10* times as fast as biological synapses, 
when the voltage was on the order of +10/ 
-8.5 V, yielding an extremely high electric 
field of ~1 V/nm across the proton-conducting 
phosphosilicate glass (PSG) SE. Surprisingly, 
we also show that under appropriate operat- 
ing parameters, this nonlinear ionic-electronic 
device was robust and reversible, operating 
successfully over millions of cycles. Although 
the layout of our three-terminal device is 
analogous to that of a solid-state battery, it 
operated 10°° times as fast as conventional 
solid-state batteries. Finally, even when ap- 
proaching the ultimate speed limit of our 
devices, the heat generated per program- 
ming task was still favorable compared with 
that generated by a human synapse (~10 fJ 
per state). 

Interest in engineering the ideal program- 
mable resistor for analog computing appli- 
cations has skyrocketed owing to increasing 
workloads of deep learning problems (12-14) 
and diminishing expectations for transistor 
performance improvements through size scal- 
ing (5). Highly optimized digital application- 
specific integrated circuits that run reduced 
precision arithmetic operations can still manage 
inference tasks (J6); however, the resolution 
requirements for training tasks do not allow 
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Fig. 1. Nanosecond protonic programmable resistors. (A) Three-dimensional illustration of the protonic 
programmable resistors studied in this work. Au (yellow), WO3 (green), PSG (magenta), and Pd (gray) 
layers are indicated. As a result of an engineered sidewall, the Pd layer that overlaps with the channel 
electrodes is isolated from the remainder of the gate electrode. G, gate; D, drain; S, source. (B) False-colored 
top-view SEM image of a fabricated device with a 30-nm-by-60-nm channel. (€) Transmission electron 
microscopy (TEM) cross-sectional image of a protonic programmable resistor that had previously 

been extensively modulated in the ultrafast regime. 
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Fig. 2. Ultrafast and energy-efficient modulation characteristics of protonic programmable resistors. 
(A) Modulation performance of a 50-nm-by-150-nm protonic device with 10-nm PSG, showing fast (5 ns per 
pulse), nearly linear, and symmetric characteristics. W, width; L, length. (B) Retention behavior of the protonic 
device for =100 s at different conductance levels over the full dynamic range. (C) Endurance characterization of the 


protonic device, displaying nondegrading modulation over 10° pulses conducted over 30 hours. 


sufficient bit reduction (77) and make beyond- 
digital approaches imperative. The core idea 
behind analog training accelerators is to pro- 
cess information locally using physical device 
properties instead of conventional Boolean 
arithmetic—i.e., using Ohm’s and Kirchhoff’s 
laws for the matrix inner product (/8) and 
threshold-based updating for the outer pro- 
duct (19). However, the performance benefits 
attained by analog training processors are de- 
pendent on a set of very strict properties: The 
device must be fast, energy-efficient, nonvola- 
tile, and reversible, and it must show symmetric 
conductance modulation with many conduct- 
ance states across a large dynamic range (20, 27). 
Unfortunately, readily available memory tech- 
nologies [e.g., phase-changing (22, 23), fila- 
mentary (24), bulk-switching (25-28), and 
ferroelectric (29) resistive memories] do not 
completely satisfy these requirements. As a 
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result, demonstrations so far have required 
additional control circuitry and serial opera- 
tions (22-24) to correct such imperfections, at 
the expense of substantially diminished space- 
time-energy performance. 

Because the core idea of neural network train- 
ing is to tune the state parameters (weights) 
through many small incremental modifica- 
tions (12, 13), it is reasonable that devices 
originally designed for information storage 
purposes (i.e., memory technologies) do not 
perform well in this information processing 
application. Devices for analog deep learn- 
ing (i.e., training) must be primarily optimized 
for state transition (i.e., modulation) rather 
than long-term state preservation (i.e., non- 
volatility). The retention of the weight pa- 
rameters is obviously important such that the 
information learned from previous inputs is 
not lost faster than the ongoing optimization. 


However, given that the purpose of the net- 
work is not to memorize correct labels for the 
training dataset but to find generalized fea- 
tures instead, retaining exact values for ex- 
tended durations of time loses relevance. The 
best example for this argument can be found 
in digital neural network training, for which it 
is acommon practice to simulate weight decay 
on values that could otherwise be retained for 
indefinite time (30). Because the properties that 
ensure long retention times contradict those that 
enable high-speed and low-energy modulation 
(31), we conclude that for training implementa- 
tions the focus should be on the latter two, trading 
retention down to the ~100-s-to-1000-s range 
for nanosecond and subpicojoule performance 
[see the supplementary materials (SM)]. 

Any memristive technology that aims to 
realize practical analog processors must 
contain a high density of devices built on top 
of standard complementary metal-oxide semi- 
conductor (CMOS) logic and therefore must 
rely on CMOS-compatible materials and in- 
volve back-end-of-line-compatible processes. 
Moreover, device operation should not depend 
on unconventional environmental condi- 
tions such as high temperature or humidity. 
With these concepts in mind, we developed 
an integration-friendly technology with in- 
organic solid materials that are native to 
conventional Si processing: WOs as the active 
channel material, nanoporous PSG as the 
protonic SE, and Pd as the hydrogen reservoir 
and controlling gate. The basic operating prin- 
ciple of the device relies on modulating the 
channel conductance via the electrochemically 
controlled intercalation of protons (chosen as 
the ion with the smallest radius and lightest 
mass) into WO; (32, 33). 

The rationale for choosing these materials is 
as follows: WO; is known for its conductance 
modulation and electrochromism upon ion 
intercalation (25, 32-34) with well-established 
dynamics (32, 35). For analog deep learning 
applications, the channel material must also 
have high base resistivity (79), proton in- 
sertion rate, and sensitivity. We found that 
superior modulation characteristics result 
from reactive sputtering of WO; at room tem- 
perature, followed by a 400°C annealing pro- 
cess that both oxidizes and crystallizes the 
material (see SM for process details). Regard- 
ing the electrolyte, the material must simulta- 
neously show very high electronic resistivity 
and high proton conductivity. In our previous 
work, we identified nanoporous PSG as an 
outstanding SE (36-38) that displays both 
properties at room temperature without the 
need for hydration (i.e., humidity) (33). Fi- 
nally, Pd was chosen as the gate metal and 
hydrogen reservoir (PdH,,), owing to its capa- 
bility to take up and store hydrogen (39). 

The three-terminal protonic programmable 
resistors studied in this work (Fig. 1A) were 
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Fig. 3. Voltage dependence of conductance modulation. (A) Channel 
conductance change per pulse (AGchannel) for different pairs of pulse 
amplitude (Vouise) and duration (tpuise) for a device with 5-nm PSG thickness. 
Points represent averaged experimental data over 1000 identical pulses for a 
given Vouise"tpuise Pair, whereas the meshed surface represents the fitted 
result. The apparent hopping distance (a) and power factor (B) values used to 


fabricated through a series of electron-beam 
lithography processes. A self-aligned gate struc- 
ture was used to scale down device dimensions, 
avoiding mask alignment limitations. In a key 
design aspect of this process, the Pd layer was 
overlaid across a large region (Fig. 1B), which 
was then used as an etch mask for the PSG 
layer beneath. The height of the channel elec- 
trodes (Au) was calibrated such that the PSG 
layer could cover the sidewalls and the Pd layer 
that overlapped with the channel electrodes was 
disconnected from the gate electrode (Fig. 1C), 
avoiding the charging of those regions and un- 
necessary capacitance. This configuration was 
intended to maximize energy efficiency. The 
complete fabrication flow, additional metrology 
results, and yield details can be found in the SM. 

Before testing, the devices were exposed to 
forming gas (3% H, in N2) at room temperature 
for protonation of the Pd reservoir and then 
pumped down to vacuum. Future iterations of 
these protonic devices will be encapsulated to 
avoid environmental preconditioning steps. 
Figure 2A shows the channel conductance 
modulation of a 50-nm-by-150-nm device 
with 10-nm-thick PSG for 1000 protonating 
voltage pulses (V* = 10 V) followed by 1000 
deprotonating pulses (V" = -8.5 V). Between 
successive pulses, the channel conductance 
was read under drain-source voltage (Vps) = 
0.1 V and gate current (/g) = 0 conditions 
and averaged for ~1 s. The devices displayed 
nearly ideal characteristics in terms of (i) high 
modulation speed, responding to 5-ns voltage 
pulses; (ii) nearly linear and symmetric behav- 
ior for incremental and decremental changes; 
Gii) conductance retention characteristics over 
durations longer than ~10”° times the unit pulse 
time (Fig. 2B); (iv) dynamic conductance range 
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of 20x; (v) optimal base resistance of 88 megohms 
for readout (19, 32); and (vi) preservation of these 
ideal properties without any degradation over 
extended time and use (Fig. 2C). 

Moreover, the devices showed excellent en- 
ergy efficiency under this ultrafast operation, 
the gate current supplied during each pulse 
being too small to be precisely measured for the 
small devices. The energy consumption during 
the transients was estimated to be ~2.5 fJ per 
pulse (see SM), which is a technology-agnostic 
overhead related to charging and discharging 
the gate capacitance. On the other hand, the 
energy consumed in proton transfer while the 
5-ns voltage pulse was at its peak value was 
estimated to be ~15 aJ per pulse for the device 
whose performance is shown in Fig. 2. This 
value was based on a dc gate current mea- 
surement of ~30 nA at 10 V for a 750-nm-by- 
1000-nm device. This latter value is associated 
with the efficient shuttling of ions within the 
gate stack under the high electric field. Addi- 
tional data and calculations related to the energy 
consumption can be found in the SM. 

As a result, the combined material, process- 
ing, and performance benefits of the all-solid- 
state protonic devices demonstrated here exceed 
those of nonvolatile memory technologies. 

To explain these desirable modulation char- 
acteristics, we have developed a model for 
device operation that consists of two key 
parts: (i) proton transport in the PSG and (ii) 
proton-coupled electron-transfer reaction 
rates at the PSG/electrode interfaces. Both 
processes have qualitatively similar formal- 
isms and dependencies—the former is governed 
by hopping conduction in a disordered solid 
with random site energies (4-7, 10), whereas 
the latter is determined by the Butler-Volmer 


fit the data are 5.6 A and 1.2, respectively. The pulse time dependence of 
the protonation-deprotonation dynamics was empirically approximated as a 
power law (tpuise). (B) Estimation of the apparent hopping distance from 
repeating the same experiment detailed in Fig. 3A for 25 devices selected 
over four different chips with different PSG thicknesses. Error bars indicate the 
SD of values acquired from different devices with the same PSG thickness. 


charge-transfer kinetics (8, 9). Conventionally, 
a high activation energy for protons to be 
freed from their sites (e.g., Si-O-H or P-O-H) 
results in low ionic conductivity (40) and ul- 
timately limits the operation speed at room 
temperature. For amorphous SiOz, this activa- 
tion energy was reported to be 0.38 eV, which 
is much larger than the thermal energy in 
ambient conditions (25.9 meV) (41). 

In the presence of a high electric field, the 
energy barrier to ion conduction is lowered 
in the field direction, yielding an enhanced 
proton hopping current 


(1) 


av, 
fn) sin (8 


2kgTdpsa 


where g is the electron charge, kg is the 
Boltzmann constant, T is the temperature, a 
is the apparent hopping distance, and dpgq is 
the thickness of the electrolyte (5). Owing 
to the high resistance of the PSG layer, all of 
the pulse voltage is assumed to drop across 
the electrolyte (Vpsg ~ Vputse; see SM for de- 
tails). Figure 3A shows the experimentally ob- 
served conductance change per pulse (AGaannel) 
as a function of pulse amplitude (Vpuise) and 
pulse time (fputse) for a device with dpgq = 5 nm. 
Over a range of electric fields similar to that 
detailed in Fig. 2, the results shown in Fig. 3 
closely followed Eq. 1, as indicated by the 
lines. Furthermore, experiments for devices 
with different dpgg values allowed us to ex- 
tract a hopping distance for the PSG of 5.6 A 
(Fig. 3B). This result is in good agreement 
with previously reported values for amorphous 
silica glasses (41-43). 

In our case, the electric field (~1 V/nm) 
across the hopping distance (a ~ 0.5 nm) was 
so high that it might completely remove the 
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Fig. 4. Modulation dynamics for short and long pulse durations. (A) Channel 
conductance modulation of the device whose performance is presented in Fig. 2 for 
increasing pulse duration under the same programming voltages. Between pulses, 
the channel conductance is read and averaged for =1 s. (B) SEM image of the device 


activation barrier within the PSG (0.4 eV). 
This effect resolves the bottleneck of low pro- 
ton conduction at room temperature, thus en- 
abling high-speed operation. Such an effect 
was predicted in simulations (44) but has not 
been previously observed experimentally, as 
the required conditions are beyond the break- 
down field or the electrochemical stability 
window of traditional electrolyte materials 
(45). Instead, PSG allows a high critical field 
(8 to 15 MV/cm) (46, 47) in addition to a 
moderate base proton conductivity (36-38), 
making it an ideal electrolyte choice for this 
application. Moreover, under the classical 
statistical mechanics scenario, reducing the 
migration energy below a few kgT boosts the 
likelihood of classical or quantum ballistic 
motion of protons. Indeed, in liquid water, 
proton motion is known to have a pronounced 
quantum character, with activationless quan- 
tum nuclear dynamics in some exchange 
events between water molecules (48). Thus, under 
such high electric fields, “quantum ballistic” 
transport of protons in solids may be realized. 

Unlike the conductive filament devices con- 
trolled by angstrom-scale short-range ion mo- 
tion at the tip of the filament (49), our protonic 
devices rely on long-range (~10 nm) and uni- 
form partitioning of protons across the electrolyte 
with the protons then behaving as dynamic 
dopants in the entire H,,WO, channel (32, 35). 
Furthermore, conducting filament formation 
is assisted by Joule heating, with local temper- 
atures rising as high as ~500 K (50), whereas 
in our case ion transport is facilitated only by 
the electric field with negligible temperature 
increase (AT ~ 2 K; see SM). As a result, al- 
though very-short-range ion transport at the 
tip of the filament can readily occur in sub- 
nanosecond time scales, the dependence of 
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filament properties on the local microstruc- 
ture and microchemistry makes those devices 
unpredictable and stochastic (2, 9, 49). On the 
contrary, devices studied in this work are mod- 
ulated by macroscopic chemical and electronic 
modifications, which result in distinctly control- 
lable and deterministic performance character- 
istics. These benefits make their acceleration 
to the nanosecond regime highly relevant. 

The ability to rapidly shuttle protons within 
bulk PSG at extreme speeds shifts the bottle- 
neck of proton transport to the interfaces. 
Although the hydrogen uptake rate of Pd is 
high (57) and the PdH,/PSG interfacial reac- 
tion is likely also efficient, the same cannot be 
said for proton insertion into polycrystalline 
WO; (52), which shows up as a PSG/H,WO; 
interfacial charge-transfer resistance. Because 
protons do not have high diffusivity in H,WO; 
(53), it is likely that the inserted species cannot 
quickly vacate their sites near the interface, 
thus further reducing the insertion rate in a 
self-limiting fashion. These factors result in an 
excess of highly energetic protons at the WO;/ 
PSG interface and may lead to Hg gas for- 
mation and buildup at the interface (54). 

The electrical signature and morphological 
consequences of long-pulse-time stressing of 
the devices are captured in Fig. 4. In Fig. 4A, 
the device previously characterized in Fig. 2 
was tested under the same V,,)se but for in- 
creasing fpuise- Above 40 ns a cascading ef- 
fect was apparent, in which the conductance 
change increased with each pulse, ultimately 
causing device failure for 90-ns pulses. A 
scanning electron microscopy (SEM) image 
of the device after this experiment is shown 
in Fig. 4B, whereas Fig. 4C shows an image of 
another device stopped at earlier stages (corre- 
sponding to the 40-ns pulse regime shown in 
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after the experiment shown in (A). (©) SEM image of another device captured during 
early degradation. (D) Si-L23 EELS spectra of the PSG layer for tested (red and 
orange) and fresh (blue) devices. The testing conditions of the device were identical 
to those given in Fig. 2, operating under the ultrafast regime only. 


Fig. 4A). These images show damage features 
that we suggest were consistent with H, 
gas evolution at the PSG/H,WOs interface, 
nanobubble formation, and stress buildup. 

Most importantly, as shown in Fig. 2C, no 
degradation was observed under high-speed 
(5 ns) operation. To provide further evidence 
of good endurance characteristics in the ultra- 
fast regime, we performed electron energy-loss 
spectroscopy (EELS) for the SE in fresh and 
tested devices. Figure 4D shows that the Si Lo; 
energy-loss spectra were similar in active and 
inactive regions across the PSG layer, which 
indicates that there was no stoichiometry 
change or that the change was too subtle to 
be detected. Considering that the widest 
electrochemical stability window for room 
temperature protonic electrolytes was pre- 
viously ~3.35 V (55), we attribute our de- 
vices’ ability to operate stably and reversibly at 
10 V/-8.5 V to the operation at nanosecond 
transients, where the PSG does not have to 
conform to the thermodynamic stability re- 
quirement for electrolytes (established after 
quasi-static wait times). Furthermore, these 
short time scales may simply be too fast for 
oxygen motion (slower than that of protons), 
which would otherwise cause degradation of 
material properties. We believe these are two 
key physical dynamics underlying the no- 
damage “protonic breakdown,” a transient 
phenomenon that we took advantage of in this 
novel ultrafast long-range transport regime. 

In summary, we explored the limits of all- 
solid-state ionics to answer the question of 
how much faster an electrochemical artificial 
synapse can operate relative to its biological 
counterpart. Under extreme electric field con- 
ditions, we demonstrated ultrafast modula- 
tion of nanoscale protonic programmable 
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resistors with outstanding energy efficiency. 
Moreover, the devices exhibited many non- 
volatile channel conductance states with good 
retention across a large and ideal dynamic 
range, which could be programmed reversibly, 
repeatedly, and symmetrically. Operation of 
the devices at extreme electric fields across the 
stack without any material degradation is the 
key breakthrough to enable such performance. 
Under these conditions, the activation ener- 
gies that govern proton transport as well as 
charge-transfer reactions at the interface were 
substantially lowered, thus resulting in opera- 
tion speed enhanced by six orders of magni- 
tude at room temperature. Beyond artificial 
synapses, these findings open up possibilities 
in fields that require fast ion motion, such as 
microbatteries, artificial photosynthesis, and 
light-matter interactions. 
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CILIA AND FLAGELLA 


In situ architecture of the ciliary base reveals the 
stepwise assembly of intraflagellar transport trains 


Hugo van den Hoek?>+, Nikolai Klena*>+, Mareike A. Jordan®, Gonzalo Alvarez Viar>®, 
Ricardo D. Righetto', Miroslava Schaffer’, Philipp S. Erdmann®, William Wan’, Stefan Geimer®, 
Jiirgen M. Plitzko*, Wolfgang Baumeister’, Gaia Pigino™**, Virginie Hamel**, 


Paul Guichard**, Benjamin D. Engel’?* 


The cilium is an antenna-like organelle that performs numerous cellular functions, including motility, 
sensing, and signaling. The base of the cilium contains a selective barrier that regulates the entry 

of large intraflagellar transport (IFT) trains, which carry cargo proteins required for ciliary assembly and 
maintenance. However, the native architecture of the ciliary base and the process of IFT train assembly 
remain unresolved. In this work, we used in situ cryo—electron tomography to reveal native structures 
of the transition zone region and assembling IFT trains at the ciliary base in Chlamydomonas. We 
combined this direct cellular visualization with ultrastructure expansion microscopy to describe the 
front-to-back stepwise assembly of IFT trains: IFT-B forms the backbone, onto which bind IFT-A, 
dynein-1b, and finally kinesin-2 before entry into the cilium. 


ilia (and closely related flagella) are evo- 
lutionarily conserved eukaryotic organ- 
elles that extend from the cell surface 
and are involved in a wide variety of 
functions, including cell motility, fluid 
flow generation, sensing, and signaling (J, 2). 
The cilium consists of an axoneme of micro- 
tubule doublets (MTDs), which extend from 
the microtubule triplets (MTTs) of the centri- 
ole (also called a basal body) and are sheathed 
in a ciliary membrane. At the base of the 
cilium, a specialized region known as the tran- 
sition zone (TZ) gates entry and exit of both 
membrane-bound and soluble ciliary proteins 
(3-5). Among its many gating functions, the 
TZ is believed to regulate entry of intra- 


flagellar transport (IFT), bidirectional traf- 
fic between the ciliary base and tip driven 
by kinesin and dynein motors (6, 7). IFT is 


'Biozentrum, University of Basel, 4056 Basel, Switzerland. 
*Helmholtz Pioneer Campus, Helmholtz Munich, 85764 
Neuherberg, Germany. “Department of Molecular Structural 
Biology, Max Planck Institute of Biochemistry, 82152 
Martinsried, Germany. “Department of Molecular and Cellular 
Biology, Section of Biology, University of Geneva, 1211 
Geneva, Switzerland. “Human Technopole, 20157 Milan, Italy. 
5Max Planck Institute of Molecular Cell Biology and Genetics, 
01307 Dresden, Germany. Department of Biochemistry and 
Center for Structural Biology, Vanderbilt University, Nashville, 
TN 37232, USA. ®Cell Biology and Electron Microscopy, 
University of Bayreuth, 95447 Bayreuth, Germany. 
*Corresponding author. Email: gaia.pigino@fht.org (G.P.); 
virginie.hamel@unige.ch (V.H.); paul.guichard@unige.ch (P.G.); 
ben.engel@unibas.ch (BIDE) ~ 


TIhese authors contributed equally to this work. 


29 JULY 2022 * VOL 377 ISSUE 6605 543 


RESEARCH | REPORTS 


24 MIRE RE Te) 


pede AB EYE? | SOISS ° 


¢ 3 s ¢ 8 8 


8 


1 
4p 
| 
oe i 


°9 


Fig. 1. Cryo-ET structure of the ciliary transition zone within native 
Chlamydomonas cells. (A) 2D slice through a cryo-electron tomogram, 
showing a TZ in longitudinal cross section. Stellate fibers, purple; Y-links, 
turquoise; MTD helical sleeve, dark blue; stellate plate, white asterisk (see fig. 
S2); and IFT, yellow (see Fig. 2). Scale bar: 100 nm. (B) Position along the TZ 
occupied by different structures. Distances are measured relative to the stellate 
plate (see fig. S6), defined as the “O nm” origin point. SOFA, site of flagellar 
autotomy, where the cilium is cleaved (see figs. S6 and S7). CP, central pair. 
Error bars indicate standard deviation. (© to E) Composite model of the TZ, 
combining averages of the stellate (purple), Y-links (turquoise), MTD sleeve 
(dark blue), and associated MTD (light blue), with schematic renderings of MTTs 
(dark gray), MTDs (light gray), and the CP (white). (C) Longitudinal section 
view of the complete composite model, assembled according to the measured 
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lengths and positions of each component, with 21 Y-link repeats, 21 stellate 
repeats (7 proximal of the plate, 14 distal), and 5 MTD sleeve repeats. The model 
shows straight MTTs and MTDs, but, as seen in (A) and quantified in (16), the 
centriole is actually a slightly convex barrel. [(D) and (E)] Cross-sectional views 
through the indicated regions of the composite model, showing (D) MTDs 
encased in the helical sleeve, with the CP in the middle, and (E) the nine-pointed 
stellate cylinder attached to MTDs decorated with Y-links. (F and G) Side and 
cross-sectional views of a single MTD attached to (F) stellate fibers and Y-links 
and (G) the helical sleeve. Protofilaments of the A- and B-microtubules are 
numbered. (H) Unrolled composite model, viewed from the center of the TZ, 
looking outward toward the inner wall of the stellate cylinder. One continuous 
helical density of the six-start stellate helix is marked in pink (pitch: 49.2 nm). 
MTDs are numbered. See also movie S1. 


required for assembly and maintenance of 
the cilium, as well as for mediating many of 
its signaling functions (8). A pool of IFT pro- 
teins and motors resides at the ciliary base for 
several seconds before entry into the cilium 
(9, 10). However, it remains to be characterized 
how these proteins assemble into the elaborate 
multi-megadalton IFT trains observed running 
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along the ciliary axoneme (77). In this study, 
we combined in situ cryo-electron tomogra- 
phy (cryo-ET) (72) with ultrastructure expan- 
sion microscopy (U-ExM) (73) of the green alga 
Chlamydomonas reinhardtii, a classic model 
for cilia research, to reveal native TZ structures 
and stepwise assembly of IFT trains at the 
ciliary base. 


Vitrified Chlamydomonas cells were thinned 
with a focused ion beam (FIB) (J4) then imaged 
by cryo-ET, revealing the ciliary base inside the 
native cellular environment (figs. S1 and S2). 
Focusing on the TZ, we observed several dif- 
ferent structures attached to the MTDs (Fig. 1, 
A and B), which we resolved in molecular de- 
tail by subtomogram averaging (Fig. 1, C to H, 
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Fig. 2. Cryo-ET structures of assembling IFT trains attached to the transition 
zone. (A) Composite structures of assembling IFT trains (IFT-B, yellow; IFT-A, 
orange; and dynein-1b, red) engaging the ciliary TZ (colored as in Fig. 1). A 
schematic membrane has been added (gray double lines). Subtomogram averages 
have been mapped back into their positions inside a representative tomogram. 
(Inset) Rotated top view of the rightmost IFT train from (A). (B and ©) 2D slices 
through tomograms, showing assembling IFT trains, and (B’ and C’) overlaid 3D 
train structures, with their front ends attached to the TZ. (Inset) Enlarged view 


and figs. S3 and S4). The proximal ~180 nm of 
the TZ is occupied by peripheral Y-links (Fig. 1, 
turquoise) and luminal stellate fibers (Fig. 1, 
purple, and fig. S2), which resemble a nine- 
pointed star in cross section (15). Our structure 
reveals that the stellates form a helical cylinder 
with a six-start helix and a pitch of 49.2 nm 
(Fig. 1H). Notably, this matches the pitch of 
the inner scaffold, a three-start helical cylin- 
der in the lumen of the centriole (16). The 
nine points of the stellate star bind MTDs at 
protofilament A3, with a longitudinal perio- 
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dicity of ~8.1nm along each MTD (Fig. 1F and 
fig. S5), which matches the 8.2 nm length of 
one tubulin dimer. 

Y-links are present in many species. They 
connect MTDs to the ciliary membrane and 
help gate transport into and out of the cilium 
(17-20). Our structure reveals a broad density 
attaching the Y-links to the MTD, spanning 
protofilaments AQ, A10, and B1 to B4, with a 
longitudinal repeat of ~8.3 nm (Fig. IF and 
fig. S5). However, the outer densities of the 
Y-links that connect to the ciliary membrane 
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showing the double AAA+ adenosine triphosphatase rings of the dynein-lb motors. 
Arrowhead colors correspond to (A): MTT/MTD, gray, and membrane, white. 
Scale bars: 100 nm in (B) and (C), 20 nm in the inset. (D) Different views of the 
cytosolic IFT train composite subtomogram average. Colored as in (A). In the 
lower right, IFT-A was removed to show details on IFT-B. Red asterisks indicate 
the extra IFT-B density, which was not observed on mature trains inside cilia (See 
fig. S11). Assembling trains were only found attached to the TZ, not in the 
surrounding cytosol farther from the ciliary base (see fig. Sl). See also movie S2. 


were not resolved, likely owing to flexibility 
of these filamentous structures. We observed 
that the Y-links extend along the MTDs prox- 
imal of the ciliary membrane (Fig. 1A). This 
suggests that connection to the membrane is 
not a prerequisite for MTD Y-link decoration. 

Immediately distal to the stellates and Y-links, 
we found a distinct helical density completely 
decorating the MTDs (Fig. 1, dark blue). This 
helical “sleeve” spans 76 + 6 nm along the 
MTDs, with a periodicity of ~16.4 nm (fig. S5). 
Cryo-ET and U-ExM immediately after ciliary 
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Fig. 3. IFT trains undergo stepwise assembly from front to back. (A and 

B) Assembly states of IFT trains relative to structures of the TZ [shown at the 
bottom of (B)]. Distances are measured by the number of IFT-B subunits relative 
to the onset of the Y-links (defined as the “O IFT-B” origin point). (A) Position 
and train assembly state, measured for each IFT-B subunit of each assembling 
IFT train. Assembly states [color code in (A) applies to all panels] were 
determined on the basis of occupancy of IFT-A and/or dynein-1b on the IFT-B 
backbone. Black squares indicate train attachment points, that is, the most 
proximal IFT-B subunit that is close enough to the MTD (<20 nm) to be 
considered bound to the microtubule. The three trains marked with black 
asterisks lack a complete “middle” (no dynein is bound) and thus were omitted 
from analysis in (D) because “back-end B + A” could not be split from “front-end 
B + A.’ Of the 70 trains in our dataset, 30 had front ends cropped by the FIB 
milling and therefore were not quantified, as their starting point relative to the TZ 
could not be determined. Eight more trains had cropped back ends (indicated 


with a red “x") but were included in the analysis. (B) Cumulative plot of all 
assembling IFT trains in (A), with positions in relation to the TZ (shown at 
bottom). Colored curves above the line show the summed abundance of each IFT 
assembly state. The black curve below the line shows the distribution of MTD 
attachment points, summed from the black squares in (A). (C) Distribution 

of IFT train lengths and completeness of assembly [colored as in (A)] for mature 
anterograde trains found in the cilium (11). (D) Comparison of the abundance 
of assembly states at the front and back ends of assembling (“A”) versus mature 
(“M") trains [i.e., trains in (A) versus trains in (C)]. Box, median and 25 to 
75% percentiles; X, mean; whiskers, 1.5x interquartile range; points, outliers. 
Statistical significance assessed by unpaired t test (****P < 0.0001; **P = 
0.002; *P = 0.03; ns, not significant, P > 0.05). Because the backs of some 
assembling trains in this analysis were clipped by the FIB milling [“x” in (A)], the 
“IFT-B only” region of assembling trains should be slightly more pronounced 
than the quantification shown here. 


shedding (figs. S6 and S7) revealed that the 
MTD sleeve overlaps with the ciliary cleavage 
site, also known as the site of flagellar auto- 
tomy (SOFA) (27) (Fig. 1B). Consistent with this 
finding, the MTD sleeve was absent from the 
severed ends of MTDs both on the TZ (fig. S6, 
C and D) and isolated cilia (22). We therefore 
hypothesize that this structure might help reg- 
ulate axoneme severing and is lost during the 
process. Because the sleeve density covers every 
MTD protofilament, it should sterically hinder 
the attachment of molecular motors and thus 


may also play a role in regulating IFT entry or 
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exit. The sleeve does not appear to be present on 
the TZ of motile cilia from bovine trachea (23). 
It has been known for decades that a pool of 
IFT proteins is localized near the base of the 
cilium (24, 25), but the structural organization 
of this basal pool has remained a mystery. In 
our tomograms, we observed filamentous strings 
of particles with one end attached to the TZ and 
the other end splayed into the cytosol, passing 
between the transition fibers (Fig. 2A and fig. $8). 
These strings consisted of three layers, each with 
a different shape and periodicity (Fig. 2, B and C, 
and fig. S9). We performed subtomogram aver- 


aging of particles along each layer and combined 
them to produce a composite molecular struc- 
ture (Fig. 2D and fig. S10). Comparison to a 
previously published cryo-ET structure of mature 
anterograde IFT trains within the cilium (77) 
confirmed that the cytosolic strings are indeed 
IFT trains. Our observed cytosolic train struc- 
ture was similar to the anterograde train struc- 
ture (fig. S11), enabling us to assign densities to 
IFT-B complexes, IFT-A complexes, and dynein-Ib 
motors, which have longitudinal periodicities 
of ~6, ~11, and ~18 nm, respectively (fig. $12). 
As was shown for anterograde IFT, we observed 
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Fig. 4. Assembly of anterograde IFT trains examined by U-ExM. (A to 

E) Longitudinal views of assembling IFT trains at the ciliary base, visualized by 
antibody staining and U-ExM. (A) KAP-GFP (cyan). Connected arrows mark 

) and (J). (B) DIbLIC-GFP (green). (C) IFT46- 
YFP (yellow). White arrows indicate approximate positions of top views in 

(F) and (G). (D) IFT72 (yellow). (E) IFT139 (gray). Tubulin staining is magenta. 
(F and G) Top views displaying ninefold occupancy of IFT trains at (F) the distal 
the central pair) and (G) the proximal TZ 
(see fig. S18). (H) Measured lengths of 3D traces for each IFT component on 
icance assessed by 

not significant, P > 0.05). 


the approximate region shown in ( 


TZ (confirmed by the presence of 


the assembling trains (see fig. SI5E). Statistical signif 
unpaired t test (****P < 0.0001; ***P = 0.0009; ns, 


that these cytosolic trains were attached to the 
MTD B-tubule (fig. S13D) (26) and that dynein-1b 
was loaded onto the trains as a cargo in an 
autoinhibited state (17, 27). Thus, much of the 
IFT pool at the ciliary base consists of TZ- 
tethered anterograde IFT trains that are under- 
going assembly before entry into the cilium. We 
did not see retrograde trains at the ciliary base, 
which implies that they rapidly disassemble, 
perhaps even before exiting the TZ. 

We observed two notable differences be- 


tween assembling and mature IFT trains. First, 
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assembling trains are flexible and display re- 
gions of high curvature (Fig. 2, A to C, and 
fig. S9), whereas trains in the cilium have an 
extended straight conformation (fig. $12) (1D), 
which is likely maintained by interactions with 
cargos and the ciliary membrane. Second, as- 
sembling IFT-B has a prominent extra density 
on the side opposite IFT-A (Fig. 2D and fig. S11; 
red asterisks). In regions where IFT trains are 
bound to the TZ, this extra density is positioned 
adjacent to the MTD. The density is not part 


U-ExM length trends are consistent with cryo-ET (Fig. 3), except that 
traces were longer than IFT139. One possible explanation could be that 
added to trains after the rest of IFT-A, but these short traces might a 
influenced by a weaker anti-IFT139 antibody (see methods). (I and J) 
staining of IFT trains at the TZ for 
Arrowheads indicate the back ends 
fig. S17). (K) Consensus model of IFT train assembly from cryo-ET and U-ExM 
measurements. New IFT-B subunits are added to the back of the train, 


D1bLIC 
IFT139 is 
so be 
Dual 

) KAP + IFT139 and (J) IFT46 + IFT139. 

of each IFT component on the train (see 


forming a 
h IFT-A, dynein-lb, and finally kinesin-2 close to the 
0.24 wm in (A) to (F), (I) and (J), 0.36 um in (G). 
to (G), 100 nm in (I) and (J). 


present on IFT-B in the sections of trains that 
dangle into the cytosol (fig. S13). 

In situ cryo-ET combines native molecular 
structures with precise cellular localization, 
enabling us to plot the spatial relationship 
between assembling IFT trains and the TZ 
(Fig. 3, A and B). The front segments of IFT 
trains attach to MTDs (but never to the MTTs 
of the centriole) and then continue in the dis- 
tal direction along these microtubule tracks, 
while the back regions curve away from the TZ 


of the MTD structure itself, because it is also 


into the cytosol. The front ends of IFT trains 
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were frequently observed at the Y-links, but 
only very rarely further distal at the MTD 
sleeve (fig. S9). By mapping the IFT-B, IFT-A, 
and dynein-Ib structures along the IFT trains 
(as shown in Fig. 2A), we visualized the spa- 
tially defined order of train assembly. Anter- 
ograde trains always contain IFT-B, which 
forms a backbone scaffold upon which other 
components are attached. This train architec- 
ture is consistent with previous findings that 
IFT-B recruits IFT-A into cilia (28, 29). We 
plotted the train assembly state relative to 
each IFT-B subunit, and although we observed 
variability between individual trains (Fig. 3A), 
the cumulative plot shows a clear distribution 
of IFT train components (Fig. 3B). The back seg- 
ments of trains were in a less assembled state, 
with IFT-B extending the farthest, followed by 
IFT-A and then dynein-lb (we did note some 
assembling trains where dynein-Ib extended far- 
ther than IFT-A). No gaps were observed in the 
regions occupied by IFT-A and dynein-1b, sug- 
gesting that these components linearly oligomer- 
ize from front to back on the IFT-B scaffold. 

Next, we compared the assembling cytosolic 
trains to mature anterograde trains within the 
cilium (Fig. 3, C and D). Mature trains ex- 
hibited a continuum of lengths from 223 to 
558 nm, comprising 36 to 90 IFT-B subunits 
with an average of 62 + 16 IFT-B. In com- 
parison, the longest assembling train we ob- 
served contained 88 IFT-B subunits, and the 
average length was 44 + 17 IFT-B. Thus, just 
one IFT train is assembled at a time per MTD. 
Cytosolic and ciliary IFT trains have similarly 
incomplete front ends, whereas the incomplete 
segment at the back of cytosolic trains is 
longer, as these regions are still undergoing 
assembly (Fig. 3D). Perhaps as IFT trains enter 
the cilium, part of the incompletely assembled 
back end is left behind at the TZ to nucleate the 
front of the next train (Fig. 4K). Because as- 
sembly of IFT-A and dynein-Ib proceeds front 
to back, the front end of the new train would 
remain incomplete, as we see in mature anter- 
ograde trains inside the cilium (Fig. 3C). 

The kinesin-2 anterograde IFT motor is dif- 
ficult to identify by cryo-ET owing to its rela- 
tively small size and flexible structure. We thus 
turned to U-ExM to examine the occupancy of 
kinesin-2 on assembling trains. We imaged 
components of kinesin-2, dynein-1b, IFT-B 
(subcomplexes B1 and B2), and IFT-A (Fig. 4, 
Ato G, and fig. $14). Consistent with our cryo- 
ET data, all of these proteins formed filamen- 
tous strings at the ciliary base, with their 
distal ends in close proximity to the TZ (marked 
by centrin staining and the lack of glutamylated 
tubulin) (figs. S15 and S16) and their prox- 
imal ends curving into the cytoplasm. Using 
semiautomated tracing, we plotted the three- 
dimensional (3D) trajectories of each protein 
(fig. SI5E). IFT-B traces were longer than traces 
of IFT-A and dynein-1b (Fig. 4H). Kinesin-2 
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traces were the shortest and were localized at 
the front of IFT trains close to the TZ (fig. 
$16). Double-labeling of individual IFT trains 
confirmed that IFT-A extends farther than 
kinesin-2 toward the back of the train, and 
IFT-B extends farther than IFT-A (Fig. 4, I 
and J, and fig. S17). 

We observed ninefold occupancy of IFT 
around the majority of TZs (Fig. 4, F and G, 
and fig. S18). Measured by U-ExM, cryo-ExM, 
and cryo-ET, 93 to 96% of MTDs were bound 
by an assembling IFT train. Given that trains 
enter the cilium with a frequency of ~1 train/s 
(28, 30), we predict that trains linger at the 
ciliary base for an average of 9 s during 
assembly. Consistent with this hypothesis, 
photobleaching analysis of the Chlamydomonas 
basal pool showed that several green fluorescent 
protein (GFP)-tagged IFT proteins require ~9 s 
to recover peak fluorescence (9). Regardless, 
ciliary entry appears to be stochastic rather than 
sequential, with an avalanche-like relationship 
between lag time and IFT train size (37). 

Our combined cryo-ET and U-ExM mea- 
surements support a spatially defined sequence 
of IFT train assembly: IFT-B first forms the 
backbone, which scaffolds the front-to-back 
attachment of IFT-A, dynein-1b, and finally 
kinesin-2 close to the TZ (Fig. 4K). A major 
remaining question is how IFT entry into the 
cilium is regulated. What prevents the front 
end of the train from entering the cilium as 
assembly proceeds toward the incomplete back 
of the train? Possible mechanisms may include 
controlled loading of kinesin-2 by factors at the 
TZ, the sleeve serving as a barrier to prevent 
entry of small trains, or the extra IFT-B-associated 
density acting as a molecular brake until train 
assembly is completed. 


REFERENCES AND NOTES 


1. M.V. Nachury, D. U. Mick, Nat. Rev. Mol. Cell Biol. 20, 389-405 
(2019). 

2. D.R. Mitchell, Cold Spring Harb. Perspect. Biol. 9, a028290 
(2017). 

3. Q. Hu et al., Science 329, 436-439 (2010). 

4. H.L. Kee et al., Nat. Cell Biol. 14, 431-437 (2012). 

5. D.K. Breslow, E. F. Koslover, F. Seydel, A. J. Spakowitz, 
M. V. Nachury, J. Cell Biol. 203, 129-147 (2013). 

6. G. Pigino, Curr. Biol. 31, R530-R536 (2021). 

7. K.G. Kozminski, K. A. Johnson, P. Forscher, J. L. Rosenbaum, 
Proc. Natl. Acad. Sci. U.S.A. 90, 5519-5523 (1993). 

8. H. Ishikawa, W. F. Marshall, Nat. Rev. Mol. Cell Biol. 12, 
222-234 (2011). 

9. J. L. Wingfield et al., eLife 6, e26609 (2017). 

0. J. V. K. Hibbard, N. Vazquez, R. Satija, J. B. Wallingford, 
Mol. Biol. Cell 32, 1171-1180 (2021). 

1. M.A. Jordan, D. R. Diener, L. Stepanek, G. Pigino, Nat. Cell Biol. 
20, 1250-1255 (2018). 

2. S. Asano, B. D. Engel, W. Baumeister, J. Mol. Biol. 428, 
332-343 (2016). 

3. D. Gambarotto et al., Nat. Methods 16, 71-74 (2019). 

A. M. Schaffer et al., J. Struct. Biol. 197, 73-82 (2017). 

. S. Geimer, M. Melkonian, J. Cell Sci. 117, 2663-2674 

(2004). 

. M. Le Guennec et al., Sci. Adv. 6, eaaz4137 (2020). 

. B. Craige et al., J. Cell Biol. 190, 927-940 (2010). 

. C. L. Williams et al., J. Cell Biol. 192, 1023-1041 (2011). 

. F.R. Garcia-Gonzalo, J. F. Reiter, Cold Spring Harb. Perspect. Biol. 

9, a028134 (2017). 


a 


O OND 


20. A. R. G. De-Castro et al., J. Cell Biol. 221, e202010178 
(2022). 

21. L. M. Quarmby, in The Chlamydomonas Sourcebook (Second 
Edition), Volume 3, E. H. Harris, D. B. Stern, G. B. Witman, Eds. 
(Academic Press, 2009), pp. 43-69. 

22. K. H. Bui, T. Yagi, R. Yamamoto, R. Kamiya, T. Ishikawa, 

J. Cell Biol. 198, 913-925 (2012). 

23. G. A. Greenan, R. D. Vale, D. A. Agard, J. Cell Biol. 219, 
e201907060 (2020). 

24. D. G. Cole et al., J. Cell Biol. 141, 993-1008 (1998). 

25. Y. Hou et al., J. Cell Biol. 176, 653-665 (2007). 

26. L. Stepanek, G. Pigino, Science 352, 721-724 (2016). 

27. K. Toropova et al., Nat. Struct. Mol. Biol. 26, 823-829 

(2019). 
. C. lomini, V. Babaev-Khaimov, M. Sassaroli, G. Piperno, 
J. Cell Biol. 153, 13-24 (2001). 

29. J. M. Brown, D. A. Cochran, B. Craige, T. Kubo, G. B. Witman, 
Curr. Biol. 25, 1583-1593 (2015). 

30. B. D. Engel, W. B. Ludington, W. F. Marshall, J. Cell Biol. 187, 

81-89 (2009). 

W. B. Ludington, K. A. Wemmer, K. F. Lechtreck, G. B. Witman, 

W. F. Marshall, Proc. Natl. Acad. Sci. U.S.A. 110, 3925-3930 

(2013). 


D 


2 


CO 


31. 


aq 


ACKNOWLEDGMENTS 


We thank T. Ishikawa, A. Noga, D. Diener, and K. Lechtreck for 
helpful discussions that influenced the manuscript. G.P. and G.A.V. 
thank T. Furstenhaupt from the MPI-DBG EM facility for instrument 
support. Calculations were performed at the Max Planck Institute for 
Biochemistry computing cluster in Martinsried, Germany, and at 

the sciCORE (https://scicore.unibas.ch/) scientific computing 
center at the University of Basel. Funding: This work was funded 
by the European Research Council (ERC) under the European 
Union's Horizon 2020 research and innovation program (grant 
819826) to G.P., the Swiss National Science Foundation (SNSF) 
Grant PPOOP3_187198 to P.G., the ERC ACCENT Starting Grant 
715289 to P.G., a European Molecular Biology Organization 
(EMBO) Fellowship (grant ALTF 537-2021) to N.K., and an 
Alexander von Humboldt Foundation Fellowship to R.D.R. 

(R.D.R. also acknowledges a nonstipendiary fellowship from 
EMBO). Additional funding for personnel and instrumentation 
was provided by Helmholtz Munich, the Max Planck Society, 

and the Biozentrum of the University of Basel. Author 
contributions: M.S., P.S.E., and B.D.E. acquired cryo-ET data of 
FIB-milled C. reinhardtii cells. G.A.V. prepared samples and 
acquired cryo-ET data of isolated basal bodies. M.A.J. prepared 
samples and acquired cryo-ET data of intact cilia. H.v.d.H. 
analyzed all the cryo-ET data, with help from M.A.J., R.D.R., and 
W.W. U-ExM data acquisition and analysis was performed by N.K., 
with guidance from P.G. and V.H. Conventional electron 
microscopy was performed by S.G. Additional mentorship was 
provided by W.B. and J.M.P., along with access to instrumentation. 
GP., V.H., P.G., and B.D.E. initiated and supervised the study. 
H.v.d.H., N.K., and B.D.E. wrote the paper, with input from all 
authors. Competing interests: W.B. is on the life science advisory 
board of Thermo Fisher. Data and materials availability: Cellular 
tomograms, subtomogram averages, and composite maps have 
been deposited at the Electron Microscopy Data Bank (EMD-15252 
to EMD-15263, see table S2). Raw cryo-ET data are available 

from the Electron Microscopy Public Image Archive (EMPIAR-11078). 
All data needed to evaluate the conclusions of the study are present 
in the paper and the supplementary materials. Correspondence 
and requests for materials should be addressed to B.D.E. 
(ben.engel@unibas.ch). License information: Copyright © 2022 
the authors, some rights reserved; exclusive licensee American 
Association for the Advancement of Science. No claim to original US 
government works. https://www.science.org/about/science- 
licenses-journal-article-reuse 


SUPPLEMENTARY MATERIALS 


science.org/doi/10.1126/science.abm6704 
Materials and Methods 

Figs. S1 to S18 

Tables S1 and S2 

References (32-64) 

MDAR Reproducibility Checklist 

Movies S1 and S2 


Submitted 3 October 2021; accepted 27 June 2022 
10.1126/science.abm6704 


science.org SCIENCE 


a 


myIDP: 
A career plan customized 
for you, by you. 


Science 


Features in myIDP include: 
= Exercises to help you examine your skills, interests, and values. 


= Alist of 20 scientific career paths with a prediction of which ones best fit your 
skills and interests. 


=" A tool for setting strategic goals for the coming year, with optional reminders to 
keep you on track. 


= Articles and resources to guide you through the process. 
=" Options to save materials online and print them for further review and discussion. 


= Ability to select which portion of your IDP you wish to share with advisors, 
mentors, or others. 


=" Acertificate of completion for users that finish myIDP. 


Visit the website and start planning today! 


Oo 


Careers in partnership with: 


BYAAAS 
BURROUGHS 
//| University of UCor WE reget E 
Massachusetts sta aor 
UMASS. Medical School University of California c) 


San Francisco 


WORKING LIFE 


By Ahmed Elbassiouny 


550 


Unequal undertakings 


hen I started graduate school, I knew it was not going to be an easy ride—not only because of 
the work, but also financially. My graduate stipend is thousands of dollars below the federal 
poverty line. This is perhaps adequate when graduate students have family support to fall 
back on, or at least minimal expenses. But for me, and for many others, this couldn’t be fur- 
ther from the truth. I left behind my family, friends, and support system in Egypt when I was 
17 to pursue higher education in Canada and chase the dream of working alongside world- 
renowned scientists. I also have a chronic disease that knows no break. To stay alive, I rely on daily medi- 
cations and supplies that are not fully covered by the national health care system. My graduate stipend 
is barely enough to pay for basic living costs, and my medical expenses are nearly impossible to afford. 


To make ends meet, I have taken 
on extra hours working as a teach- 
ing assistant. I’m lucky this is even 
an option for me; in many depart- 
ments, graduate students are not 
allowed to hold extra teaching ap- 
pointments as it “distracts” from 
thesis research. The prevailing 
assumption seems to be that all 
students can and should dedicate 
every minute to their thesis, and 
failure to do so is a sign of lack of 
devotion or drive. 

But my teaching has nothing to 
do with a lack of passion for my 
field of study. I love my research. 
Sometimes, when I go home after 
setting up an exciting experiment, 
I find myself counting the hours 
until I can return to the lab and 
see the results the next morning. 
What distracts me from my science 
is not my teaching per se, but my 
financial worries. 

Even though my department allows me to take on extra 
teaching, my peers and advisers often judge me for doing 
it. In one particularly memorable instance, a junior lab 
mate asked about my plans for the summer, and I men- 
tioned I was going to be teaching in addition to present- 
ing at a conference, conducting experiments, and doing 
everything else involved in pursuing a Ph.D. “Again?!” he 
exclaimed. “You are teaching way too much! You should 
spend the summer focusing on your research and increas- 
ing your productivity.” He had no idea what I was juggling. 

I don’t tell my colleagues about my health condition be- 
cause I don’t want it to define me or my scientific capa- 
bilities. I thought working hard to hide my struggles and 
meet expectations was the best way forward. At one point, 
I was teaching three times the expected number of hours 


“What distracts me from my 
science is not my teaching per se, 
but my financial worries.” 


while also spending late nights 
and weekends on my thesis work. I 
put on a brave face, but in reality I 
was scared, lonely, and exhausted. 

Eventually my unhealthy ham- 
ster wheel got to me. My health 
suffered significantly and I devel- 
oped complications. I spent the 
latter half of 2019 and the first 
years of the pandemic running 
from one specialist to another, at 
one point requiring biweekly inva- 
sive eye procedures that left me in 
debilitating pain. Still, I continued 
to work on my thesis, returning to 
the lab as soon as my blurry vision 
returned to normal. I was trying to 
devote everything to my academic 
journey, like my peers who have 
easier circumstances—despite my 
health condition, greater expenses, 
and lack of family support. 

As I approach the final stretch 
of my Ph.D., I look forward to 
leaving academia for a job where my efforts are appreci- 
ated and my well-being respected. I am grateful for the 
scientific training I have received, but the cost has been 
too high. To truly foster inclusivity in academia, we must 
understand that academics have outside commitments 
including taking care of children, parents, and our own 
health. We should be helped through those challenges— 
for example, with less humiliating pay and reasonable 
work expectations—instead of being judged for being 
insufficiently dedicated. Inclusion means ensuring every- 
one feels comfortable, valued, and supported, whatever 
our circumstances. 


Ahmed Elbassiouny is a Ph.D. student at the University of Toronto. 
Send your career story to SciCareerEditor@aaas.org. 
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Pandemic Preparedness in a Changing World 


Be prepared 


he founder of the global Scouting movement, Robert Baden-Powell, 
[iemeesi cautioned his young charges to always “be prepared.” While 
excellent advice for those trekking into the wilderness, is it applicable 
in a broader sense to a world filled with complexities and webs of 
interdependency? | would argue it is, particularly when it comes to protecting 
the health of the global population by being properly prepared for the next 
pandemic. 


In September 2021, the White House released a short but important 
document outlining the administration's goals for future pandemic 
preparedness." It will no doubt require iterative revisions in the future to keep 
up with a changing world, but it does provide a good, immediate starting 
point for the United States—and other countries—to ready ourselves for the 
inevitable next virus outbreak. 


The key goals of this report are: (1) transform our medical defenses, (2) 
ensure situational awareness, (3) strengthen public health systems, (4) 
build core capabilities, and (5) manage the mission. It is the second goal 
that | believe provides the opportunity to have the greatest impact and 
immediate benefit for pandemic preparedness. 


This situational awareness goal includes the creation of rapid detection 
systems for identifying viruses with pandemic potential and the real-time 
monitoring of current outbreaks that could transform into pandemics. 
Consider how many lives might have been saved, not to mention unnecessary 
suffering of patients and their families, if an effective, transparent early- 
warning system was in place when the SARS-CoV-2 virus began its rampage? 
We have gotten better at real-time monitoring of COVID variants, but there is 
much room for improvement. 


To state the obvious, creating, maintaining, and managing such systems 
will not be an easy task. There will need to be increased cooperation between 
all countries, true transparency, and clear accountability. Industry, academia, 
and government agencies will need to build strong ties and greater trust 
than ever before. The financial commitment will be considerable. But 
when compared with the cost to society in terms of lost lives and battered 
economies, it will be a relatively small price to pay. 

Now is the time to act, while the impacts of the COVID pandemic are still 
fresh in our minds and the political will is still strong. Let's follow the lead of 
those young Scouts and commit to earning our “Be Prepared” badge. 


Sean Sanders, Ph.D. 
Science/AAAS 


thttps:/Avww.whitehouse.gov/wp-content/uploads/2021/09/American-Pandemic-Preparedness-Transforming-Our- 
Capabilities-Final-For-Web.pdf. 
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Fighting future viruses, together 


Fighting pandemics is a team sport. 


We must have a strong offense in play at all times to anticipate and prepare 
for what's coming. And we need a fast, effective defense to stop and neutralize 
viruses before they spread uncontrollably. 


The world is now living in an era where epidemic and pandemic threats 
have become the new normal. If there is a silver lining in the COVID-19 
pandemic, it’s what it taught us: Global collaboration is the key to stopping 
future pandemics. 

At Abbott, we are focused on helping people around the world live their 
healthiest lives, and doing so means preventing another pandemic from ever 
happening again. We are proud of our work in the fight against COVID-19 as 
the leader in testing solutions across lab, point-of-care, and rapid settings. But 
we are still playing defense against COVID-19 as it continues to evolve and as 
new viral threats emerge. 


We have to stay one step ahead, which requires strong collaboration with 
public and private partners everywhere. We have a team of virus hunters with 
decades of experience that works day in and day out looking for new threats. 


Abbott has nearly 30 years of foundational experience hunting and 
fighting viruses, and an even longer history of developing diagnostics tests. 
From inventing the first U.S. Food and Drug Administration—approved HIV 
test in 1985 to establishing our first Global Surveillance Program in 1994, the 
company’s history has led us to this moment, and we are proud to continue 
building on our previous contributions. 

Our virus hunters are now part of the Abbott Pandemic Defense Coalition, 
a first-of-its-kind global partnership launched last year with the goal of actively 
working to prevent the next pandemic. Today, coalition partners are on the 
ground in 11 countries across five continents, actively collecting, logging, and 
monitoring diverse pathogens to keep pace with new and changing viruses. 


A global pandemic such as COVID-19 is bigger than any of us—it requires 
that we come together as public health experts, governments, and private 
organizations across the world. We need to share information so we can track 
how these threats are spreading and prevent them from becoming pandemics 
in the first place. 


Now is the time to unite and act quickly and effectively as a global virus- 
fighting team. We know it’s not a matter of if another pandemic will happen, 
but when. 


Together, working side by side, we can be ready. 


John R. Hackett Jr., Ph.D. 
Divisional Vice President of Applied Research and Technology 
Abbott 
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The progression of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic in Africa has so far been heterogeneous, and the full impact is not 
yet well understood. In this study, we describe the genomic epidemiology using a dataset of 8746 genomes from 33 African countries and two overseas territories. 
We show that the epidemics in most countries were initiated by importations predominantly from Europe, which diminished after the early introduction of 
international travel restrictions. As the pandemic progressed, ongoing transmission in many countries and increasing mobility led to the emergence and spread within 
the continent of many variants of concern and interest, such as B.1.351, B.1.525, A.23.1, and C.1.1. Although distorted by low sampling numbers and blind spots, 
the findings highlight that Africa must not be left behind in the global pandemic response, otherwise it could become a source for new variants. 


evere acute respiratory syndrome coro- 
navirus 2 (SARS-CoV-2) emerged in late 
2019 in Wuhan, China (J, 2). Since then, 
the virus has spread to all corners of the 
world, causing almost 150 million cases 
of COVID-19 and more than 3 million deaths 
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by the end of April 2021. Throughout the pan- 
demic, it has been noted that Africa accounts 
for a relatively low proportion of reported cases 
and deaths—by the end of April 2021, there had 
been ~4.5 million cases and ~120,000 deaths 
on the continent, corresponding to less than 
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4% of the global burden. However, emerging 
data from seroprevalence surveys and autopsy 
studies in some African countries suggest that 
the true number of infections and deaths may 
be severalfold higher than reported (3, 4). In 
addition, a recent analysis has shown that in 
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many African countries, the second wave of the 
pandemic was more severe than the first wave (5). 

The first cases of COVID-19 on the African 
continent were reported in Nigeria, Egypt, and 
South Africa between mid-February and early 
March 2020, and most countries had reported 
cases by the end of March 2020 (6-8). These 
early cases were concentrated among airline 
travelers returning from regions of the world 
with high levels of community transmission. 
Many African countries introduced early public 
health and social measures, including inter- 
national travel controls, quarantine for return- 
ing travelers, and internal lockdown measures, 
to limit the spread of the virus and give health 
services time to prepare (5, 9). The initial phase 
of the epidemic was then heterogeneous, with 
relatively high case numbers reported in North 
Africa and southern Africa, and fewer cases 
reported in other regions. 

From the onset of the pandemic, genomic 
surveillance has been at the forefront of the 
COVID-19 response in Africa (70). Rapid im- 
plementation of SARS-CoV-2 sequencing by 
various laboratories in Africa enabled genomic 
data to be generated and shared from the early 
imported cases. In Nigeria, the first genome 
sequence was released just 3 days after the 
announcement of the first case (6). Similarly, 
in Uganda, a sequencing program was set up 
rapidly to facilitate virus tracing, and the col- 
lection of samples for sequencing began im- 
mediately upon confirmation of the first case 
(11). In South Africa, the Network for Genomic 
Surveillance in South Africa (NGS-SA) was es- 
tablished in March 2020, and within weeks, 
genomic analysis was helping to characterize 
outbreaks and community transmission (12). 

Genomic surveillance has also been criti- 
cal for monitoring ongoing SARS-CoV-2 evolu- 
tion and detection of new SARS-CoV-2 variants 
in Africa. Intensified sampling by NGS-SA in 
the Eastern Cape Province of South Africa in 
November 2020, in response to a rapid resur- 
gence of cases, led to the detection of B.1.351 
(501Y.V2) (13). This variant was subsequently 
designated a variant of concern (VOC) by the 
World Health Organization (WHO), owing to 
evidence of increased transmissibility (74) and 
resistance to neutralizing antibodies elicited 
by natural infection and vaccines (15-17). 

In this study, we performed phylogenetic 
and phylogeographic analyses of SARS-CoV-2 
genomic data from 33 African countries and 
two overseas territories to help characterize 
the dynamics of the pandemic in Africa. We 
show that the early introductions were pre- 
dominantly from Europe, but that as the pan- 
demic progressed, there was increasing spread 
between African countries. We also describe 


All author affiliations are listed at the end of this paper. 
*Corresponding author. Email: tulio@sun.ac.za 
tThese authors contributed equally to this work. 


6 


the emergence and spread of a number of key 
SARS-CoV-2 variants in Africa and highlight 
how the spread of B.1.351 (501Y.V2) and other 
variants contributed to the more severe sec- 
ond wave of the pandemic in many countries. 


SARS-CoV-2 genomic data 


By 5 May 2021, 14,504 SARS-CoV-2 genomes 
had been submitted to the GISAID database 
(8) from 38 African countries and two over- 
seas territories (Mayotte and Réunion) (Fig. 1A). 
Overall, this corresponds to approximately 
one sequence per ~300 reported cases. Almost 
half of the sequences were from South Africa 
(n = 5362), consistent with it being responsible 
for almost half of the reported cases in Africa. 
Overall, the number of sequences correlates 
closely with the number of reported cases per 
country (Fig. 1B). The countries and territories 
with the highest coverage of sequencing (de- 
fined as genomes per reported case) are Kenya 
(n = 856, one sequence per ~203 cases), Mayotte 
(n = 721, one sequence per ~21 cases), and Ni- 
geria (n = 660, one sequence per ~250 cases). 
Although genomic surveillance started early 
in many countries, few have evidence of con- 
sistent sampling across the whole year. Half 
of all African genomes were deposited in the 
first 10 weeks of 2021, suggesting intensified 
surveillance in the second wave after the de- 
tection of B.1.351 (SO1Y.V2) and other var- 
iants (Fig. 1, C and D). 


Genetic diversity and lineage dynamics 
in Africa 


Of the 10,326 genomes retrieved from GISAID 
by the end of March 2021, 8746 genomes passed 
quality control and met the minimum metadata 
requirements. These genomes from Africa were 
compared in a phylogenetic framework with 
11,891 representative genomes from around 
the world. Ancestral location state reconstruc- 
tion of the dated phylogeny (hereafter referred 
to as discrete phylogeographic reconstruction) 
allowed us to infer the number of viral imports 
and exports between Africa and the rest of the 
world, and between individual African coun- 
tries. African genomes in this study spanned 
the whole global genetic diversity of SARS- 
CoV-2, a pattern that largely reflects multiple 
introductions over time from the rest of the 
world (Fig. 2A). 

In total, we detected at least 757 [95% con- 
fidence interval (CI): 728 to 786] viral intro- 
ductions into African countries between the 
start of 2020 and February 2021, more than 
half of which occurred before the end of May 
2020. Although the early phase of the pan- 
demic was dominated by importations from 
outside Africa, predominantly from Europe, 
there was then a shift in the dynamics, with an 
increasing number of importations from other 
African countries as the pandemic progressed 
(Fig. 2, B and C). A rarefaction analysis in 


which we systematically subsampled genomes 
shows that vastly more introductions would 
have likely been identified with increased sam- 
pling in Africa or globally, suggesting that the 
introductions we identified are really just the 
“ears of the hippo,” or a small part of a larger 
problem (fig. $1). 

South Africa, Kenya, and Nigeria appear 
as major sources of importations into other 
African countries (Fig. 2D), although this is 
likely to be influenced by these three countries 
having the greatest number of deposited se- 
quences. Particularly notable is the southern 
African region, where South Africa is the source 
for a large proportion (~80%) of the impor- 
tations to other countries in the region. The 
North African region demonstrates a differ- 
ent pattern to the rest of the continent, with 
more viral introductions from Europe and Asia 
(particularly the Middle East) than from other 
African countries (fig. $2). 

Africa has also contributed to the interna- 
tional spread of the virus, with at least 324 
(95% CI: 295 to 353) exportation events from 
Africa to the rest of the world detected in this 
dataset. Consistent with the source of impor- 
tations, most exports were to Europe (41%), 
Asia (26%), and North America (14%). As with 
the number of importations, exports were rela- 
tively evenly distributed over the 1-year period 
(fig. S3). However, an increase in the number 
of exportation events occurred between De- 
cember 2020 and March 2021, which coin- 
cided with the second wave of infections in 
Africa and with some relaxations of travel re- 
strictions around the world. 

The early phase of the pandemic was char- 
acterized by the predominance of lineage B.1. 
This was introduced multiple times to African 
countries and has been detected in all but one 
of the countries included in this analysis. After 
its emergence in South Africa, B.1.351 became the 
most frequently detected SARS-CoV-2 lineage 
found in Africa (n = 1769, ~20%) (Fig. 1C). It 
was first sampled on 8 October 2020 in South 
Africa (13) and has since spread to 20 other 
African countries. 

As air travel came to an almost complete halt 
in March and April 2020, the number(s) of de- 
tectable viral imports into Africa decreased 
and the pandemic entered a phase that was 
characterized in sub-Saharan Africa by sus- 
tained low levels of within-country movements 
and occasional international viral movements 
between neighboring countries, presumably 
via road and rail links between these. Though 
some border posts between countries were 
closed during the initial lockdown period (table 
Sl), others remained open to allow trade to 
continue. Regional trade in southern Africa 
was only slightly affected by lockdown restric- 
tions and quickly rebounded to prepandemic 
levels (fig. S4) after the relaxation of restric- 
tions between June 2020 and December 2020. 
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Although lineage A viruses were imported 
into several African countries, they only ac- 
count for 1.3% of genomes sampled in Africa. 
Despite lineage A viruses initially causing many 
localized clustered outbreaks, each the result 
of independent introductions to several coun- 
tries (e.g., Burkina Faso, Céte d’Ivoire, and 
Nigeria), they were later largely replaced by 
lineage B viruses as the pandemic evolved. 
This is possibly due to the increased transmis- 
sibility of lineage B viruses by virtue of the 
D614G (Asp®*—Gly) mutation in the spike 
protein (19, 20). However, there is evidence of 
an increasing prevalence of lineage A viruses in 
some African countries (77). In particular, A.23.1 
emerged in East Africa and appears to be rapidly 
increasing in prevalence in Uganda and Rwanda 
(11). Furthermore, a highly divergent variant 
from lineage A was recently identified in Angola 
from individuals arriving from Tanzania (27). 


Emergence and spread of new 
SARS-CoV-2 variants 


To determine how some of the key SARS-CoV-2 
variants are spreading within Africa, we per- 
formed phylogeographic analyses on the VOC 
B.1.351, the variant of interest (VOD) B.1.525, 
and two additional variants that emerged and 
that we designated as VOIs for this analysis 
(A.23.1 and C.1.1). These African VOCs and VOIs 
have multiple mutations on the spike glyco- 
protein, and a molecular clock analysis of these 
four datasets provided strong evidence that 
these four lineages are evolving in a clock-like 
manner (Fig. 3, A and B). 

B.1.351 was first sampled in South Africa in 
October 2020, but phylogeographic analysis 
suggests that it emerged earlier, around August 
2020. It is defined by 10 mutations in the spike 
protein, including K417N (Lys*"”—Asn), E484K 
(Glu***— Lys), and N501Y (Asn®°Tyr) in the 
receptor binding domain (Fig. 3B). After its 
emergence in the Eastern Cape, it spread ex- 
tensively within South Africa (Fig. 4A). By 
November 2020, the variant had spread into 
neighboring Botswana and Mozambique, and 
by December 2020, it had reached Zambia and 
Mayotte. Within the first 3 months of 2021, 


Fig. 1. SARS-CoV-2 sequences in Africa. (A) Map of 
the African continent with the number of SARS-CoV-2 
sequences reflected in GISAID as of 5 May 2021. 

(B) Regression plot of the number of viral sequences 
versus the number of reported COVID-19 cases in 
various African countries as of 5 May 2021. Countries 
with >500 sequences are labeled. The shaded region 
indicates the 95% confidence interval. (C) Progressive 
distribution of the top 20 PANGO lineages on the 
African continent. (D) Temporal sampling of 
SARS-CoV-2 sequences in African countries 
(ordered by total number of sequences) through 
time, with VOCs of note highlighted and annotated 
according to their PANGO lineage assignment. 
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further exports from South Africa into Botswana, 
Zimbabwe, Mozambique, and Zambia occurred. 
By March 2021, B.1.351 had become the dom- 
inant lineage within most southern African 
countries as well as the overseas territories of 
Mayotte and Réunion (fig. $5). Our phylogeo- 
graphic reconstruction also demonstrates 
movement of B.1.351 into East and Central 
Africa directly from southern Africa. Our dis- 
crete phylogeographic analysis of a wider sam- 
ple of B.1.351 isolates demonstrates the spread 
of the lineage into West Africa. This patient 
from West Africa had a known travel history 
to Europe, so it is possible that the patient ac- 
quired the infection while in Europe or in tran- 
sit and not from other African sources (fig. S6). 


B.1.525 is a VOI defined by six substitu- 
tions in the spike protein [Q52R (Gln*’—Arg), 
A67V (Ala®’—Val), E484K, D614G, Q677H 
(Gin®”—His), and F888L (Phe**’—+Leu)] and 
two deletions in the N-terminal domain [HV69- 
70A (deletion of His and Val at positions 69 
and 70) and Y144A (deletion of Tyr at posi- 
tion 144)]. This was first sampled in the 
United Kingdom in mid-December 2020, but 
our phylogeographic reconstruction suggests 
that the variant originated in Nigeria in 
November 2020 [95% highest posterior den- 
sity (HPD) 2020-11-01 to 2020-12-03] (Fig. 4B). 
Since then, it has spread throughout much of 
Nigeria and neighboring Ghana. Given sparse 
sampling from other neighboring countries 
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Fig. 2. Phylogenetic reconstruction of the SARS-CoV-2 pandemic on the con- 
tinent of Africa. (A) Time-resolved maximum likelihood tree containing 8746 high- 
quality African SARS-CoV-2 near-full-genome sequences analyzed against a 
backdrop of global reference sequences. VOls and VOCs are highlighted on the 
phylogeny. (B) Sources of viral introductions into African countries characterized as 
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within West and Central Africa (Fig. 1, A and C), 
the extent of the spread of this VOI in the re- 
gion is not clear. Beyond Africa, this VOI has 
spread to Europe and the United States (fig. S6). 

We designated A.23.1 and C.1.1 as VOIs for the 
purposes of this analysis because they present 
good examples of the continued evolution of 
the virus within Africa (11, 13). Lineage A.23, 
characterized by three spike mutations [F157L 
(Phe®’—+Leu), V367F (Val®>”—Phe), and Q613H 
(Gin*—His)], was first detected in a Ugandan 
prison in Amuru in July 2020 (95% HPD: 2020- 
07-15 to 2020-08-02). From there, the lineage 
was transmitted to Kitgum prison, possibly 
facilitated by the transfer of prisoners. Sub- 
sequently, the A.23 lineage spilled into the 
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external introductions from the rest of the world versus internal introductions from 
other African countries. (C) Total external viral introductions over time into Africa. 
(D) The number of viral imports and exports into and out of various African 
countries depicted as internal (between African countries, in pink) or external 
(between African and non-African countries, in blue and gray). 
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general population and spread to Kampala, | new lineage classification, A.23.1 (Fig. 3, A and | Africa and Botswana in the south and Ghana in 
adding other spike mutations [R102I1 (Arg? —- B). Since the emergence of A.23.1in September | the west (Fig. 4C). However, our phylogeo- 
Ile), L141F (Leu Phe), E484K, and P681R | 2020 (95% HPD: 2020-09-02 to 2020-09-28), it | graphic reconstruction of A.23.1 suggests that 
(Pro®'—Arg)] along with additional mutations | has spread regionally into neighboring Rwanda | the introduction into Ghana may have occurred 
in nsp3, nsp6, ORF8, and ORF9, prompting a | and Kenya and has now also reached South | via Europe (fig. S6), whereas the introductions 
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Fig. 3. Genetic profile of VOCs and VOls under investigation. (A) Root-to-tip regression plots for four lineages of interest. C.1 and A.23 show continued 
evolution into VOls C.1.1 and A.23.1, respectively. r, coefficient of correlation; r, coefficient of determination. (B) Genome maps of four VOCs and VOls, where the spike 
region is shown in detail and in color and the rest of the genome is shown in gray. ORF, open reading frame; NTD, N-terminal domain; RBD, receptor binding domain; 
RBM, receptor binding motif; SD1, subdomain 1; SD2, subdomain 2. 
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into southern Africa likely occurred directly 
from East Africa. This is consistent with epide- 
miological data suggesting that the case detected 
in South Africa was a contact of an individual 
who had recently traveled to Kenya. 

Lineage C.1 emerged in South Africa in March 
2020 (95% HPD: 2020-03-13 to 2020-04-17) 
during a cluster outbreak before the first 
wave of the epidemic (73). C.1.1 is defined by 
the spike mutations $477N (Ser*””—-Asn), A688S 
(Ala®*8Ser), and M12371 (Met”?’—Ile) and 
also contains the Q52R and A67V mutations 
similar to B.1.525 (Fig. 3B). A continuous trait 
phylogeographic reconstruction of the move- 
ment dynamics of these lineages suggests that 
C.1 emerged in the city of Johannesburg and 


spread within South Africa during the first wave 
(Fig. 4D). Independent exports of C.1 from 
South Africa led to regional spread to Zambia 
(June to July 2020) and Mozambique (July to 
August 2020), and the evolution to C.1.1 seems 
to have occurred in Mozambique around mid- 
September 2020 (95% HPD: 2020-09-07 to 
2020-10-05). An in-depth analysis of SARS- 
CoV-2 genotypes from Mozambique suggests 
that the C.1.1 lineage was the most prevalent in 
the country until the introduction of B.1.351, 
which has dominated the epidemic since (fig. S5). 

The VOC B.1.1.7, which was first sampled in 
Kent, England, in September 2020 (22), has 
also increased in prevalence in several African 
countries (fig. S5). To date, this VOC has been 


detected in 11 African countries, as well as the 
Indian Ocean islands of Mauritius and Mayotte 
(fig. S7). The time-resolved phylogeny suggests 
that this lineage was introduced into Africa on 
at least 16 occasions between November 2020 
and February 2021, with evidence of local trans- 
mission in Nigeria and Ghana. 


Conclusions 


Our phylogeographic reconstruction of past 
viral dissemination patterns suggests a strong 
epidemiological linkage between Europe and 
Africa, with 64% of detectable viral imports into 
Africa originating in Europe and 41% of detect- 
able viral exports from Africa landing in Europe 
(Fig. 1C). This phylogeographic analysis also 
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suggests a changing pattern of viral diffusion 
into and within Africa over the course of 2020. 
In almost all instances, the earliest introduc- 
tions of SARS-CoV-2 into individual African 
countries were from countries outside Africa. 

High rates of COVID-19 testing and con- 
sistent genomic surveillance in the south of 
the continent have led to the early identifi- 
cation of VOCs such as B.1.351 and VOIs such 
as C.1.1 (73). Since the discovery of these south- 
ern African variants, several other SARS-CoV-2 
VOIs have emerged in different parts of the 
world, including elsewhere on the African 
continent, such as B.1.525 in West Africa and 
A.23.1 in East Africa. There is strong evidence 
that both of these VOIs are rising in frequency 
in the regions where they have been detected, 
which suggests that they may possess higher 
fitness than other variants in these regions. 
Although more-focused research on the bio- 
logical properties of these VOIs is needed to 
confirm whether they should be considered 
VOCs, it would be prudent to assume the worst 
and focus on limiting their spread. It will be 
important to investigate how these different 
variants compete against one another if they 
occupy the same region. 

Our focused phylogenetic analysis of the 
B.1.351 lineage revealed that in the final months 
of 2020, this variant spread from South Africa 
into neighboring countries, reaching as far 
north as the Democratic Republic of the Congo 
(DRC) by February 2021. This spread may have 
been facilitated through rail and road net- 
works that form major transport arteries link- 
ing South Africa’s ocean ports to commercial 
and industrial centres in Botswana, Zimbabwe, 
Zambia, and the southern parts of the DRC. The 
rapid, apparently unimpeded spread of B.1.351 
into these countries suggests that current land- 
border controls that are intended to curb the 
international spread of the virus are ineffec- 
tive. Perhaps targeted testing of cross-border 
travelers, genotyping of positive cases, and the 
focused tracking of frequent cross-border trav- 
elers, such as long distance truckers, would more 
effectively contain the spread of future VOCs 
and VOIs that emerge within this region. 

The dominance of VOIs and VOCs in Africa 
has important implications for vaccine roll- 
outs on the continent. For one, slow rollout of 
vaccines in most African countries creates an 
environment in which the virus can replicate 
and evolve: This will almost certainly produce 
additional VOCs, any of which could derail 
the global fight against COVID-19. Conversely, 
with the already widespread presence of known 
variants, difficult decisions about balancing 
reduced efficacy and availability of vaccines 
have to be made. This also highlights how 
crucial it is that trials are done. From a public 
health perspective, genomic surveillance is only 
one item in the toolkit of pandemic prepared- 
ness. It is important that such work is closely 
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followed by genotype-to-phenotype research to 
determine the actual relevance of continued 
evolution of SARS-CoV-2 and other emerging 
pathogens. 

The rollout of vaccines across Africa has 
been painfully slow (figs. S8 and S9). There 
have, however, been notable successes that 
suggest that the situation is not hopeless. The 
small island nation of the Seychelles had vac- 
cinated 70% of its population by May 2021. 
Morocco has kept pace with many developed 
nations and, by mid-March, had vaccinated 
~16% of its population. Rwanda, one of Africa’s 
most resource-constrained countries, had, within 
3 weeks of obtaining its first vaccine doses in 
early March, managed to provide first doses to 
~2.5% of its population. For all other African 
countries, at the time of writing, vaccine coverage 
(first dose) was <1.0% of the general population. 

The effectiveness of molecular surveillance 
as a tool for monitoring pandemics is largely 
dependent on continuous and consistent sam- 
pling through time, rapid virus genome se- 
quencing, and rapid reporting. When this is 
achieved, molecular surveillance can ensure 
the early detection of changing pandemic char- 
acteristics. Further, when such changes are dis- 
covered, molecular surveillance data can also 
guide public health responses. In this regard, 
the molecular surveillance data that are being 
gathered by most African countries are less 
useful than they could be. For example, the 
time lag between when virus samples are taken 
and when sequences for these samples are 
deposited in sequence repositories is so great 
in some cases that the primary utility of ge- 
nomic surveillance data is lost (fig. S10). This 
lag is driven by several factors, depending on 
the laboratory or country in question: (i) lack 
of reagents owing to disruptions in global sup- 
ply chains, (ii) lack of equipment and infrastruc- 
ture within the originating country, (iii) scarcity 
of technical skills in laboratory methods or bio- 
informatic support, and (iv) hesitancy by some 
health officials to release data. More-recent 
sampling and prompt reporting is crucial to 
reveal the genetic characteristics of currently 
circulating viruses in these countries. 

The patchiness of African genomic surveil- 
lance data is therefore the main weakness of 
our study. However, there is evidence that the 
situation is improving, with ~50% of African 
SARS-CoV-2 genome sequences having been 
submitted to the GISAID database within the 
first 10 weeks of 2021. Although the precise 
factors underlying this surge in sequencing 
efforts are unclear, an important driver is al- 
most certainly increased global interest in 
genomic surveillance after the discovery of 
multiple VOCs and VOIs since December 2020. 
We cannot reject that the observed increase 
in exports from Africa may be due to inten- 
sified sequencing activity after the detection 
of variants around the world. It is important 
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to note here that phylogeographic reconstruc- 
tion of viral spread is highly dependent on 
sampling where there is the caveat that the 
exact routes of viral movements between coun- 
tries cannot be inferred if there is no sam- 
pling in connecting countries. Furthermore, 
our efforts to reconstruct the movement dy- 
namics of SARS-CoV-2 across the continent are 
almost certainly biased by uneven sampling 
between different African countries. It is not 
a coincidence that we identified South Africa, 
Kenya, and Nigeria, which have sampled and 
sequenced the most SARS-CoV-2 genomes, 
as major sources of viral transmissions between 
sub-Saharan African countries. However, these 
countries also had the highest number of infec- 
tions, which may decrease the sampling biases 
(Fig. 1A). 

The reliability of genomic surveillance as a 
tool to prevent the emergence and spread of 
dangerous variants is dependent on the in- 
tensity with which it is embraced by national 
public health programs. As with most other 
parts of the world, the success of genomic sur- 
veillance in Africa requires that more samples 
are tested for COVID-19, higher proportions of 
positive samples are sequenced within days 
of sampling, and persistent analyses of these 
sequences are performed for concerning sig- 
nals such as (i) the presence of novel nonsynon- 
ymous mutations at genomic sites associated 
with pathogenicity and immunogenicity, (ii) 
evidence of positive selection at codon sites 
where nonsynonymous mutations are observed, 
and (iii) evidence of lineage expansions. Des- 
pite limited sampling, Africa has identified 
many of the VOCs and VOIs that are being 
transmitted across the world. Detailed char- 
acterization of the variants and their impact 
on vaccine-induced immunity is of extreme 
importance. If the pandemic is not controlled 
in Africa, we may see the production of vaccine 
escape variants that may profoundly affect 
the population in Africa and across the world. 
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Pandemic Preparedness in a Changing World 


Monkeypox is a new global threat. African scientists 


know what the world is up against 
Cases in West and Central Africa have been on the rise for decades 


BY JON COHEN 


s monkeypox stokes here-we-go-again 
fears in a pandemic-weary world, some 
researchers in Africa are having their 
own sense of déja vu. Another neglected 
tropical disease of the poor gets atten- 
tion only after it starts to infect people 
in wealthy countries. “It’s as if your neighbor's 
house is burning and you just close your window 
and say it’s fine,” says Yap Boum, an epidemiol- 
ogist in Cameroon who works with both the 
health ministry and Doctors Without Borders. 

Now, the fire is spreading. The global outbreak 
of monkeypox, which causes smallpoxlike skin 
lesions but is not usually fatal, surfaced on 7 May 
in the United Kingdom. More than 700 suspected 
and confirmed cases had been reported as of 31 
May, from every continent other than Antarcti- 
ca. It is the largest ever outbreak outside of Africa 
and is concentrated among men who have sex 
with men, a phenomenon never seen before. Pub- 
lic health officials and scientists are scrambling to 
understand how the virus spreads and how to 
stop it—and they are paying new attention to Af- 
rica’s long experience with the disease. 

“We are interdependent; Boum notes. “What 
is happening in Africa will definitely impact 
what is happening in the West and vice versa.” 

Monkeypox is endemic in 10 countries in 
West and Central Africa, with dozens of cases 
this year in Cameroon, Nigeria, and the Central 
African Republic (CAR). The Democratic Re- 
public of the Congo (DRC) has by far the high- 
est burden, with 1284 cases in 2022 alone. Those 
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numbers are almost certainly underestimates. 
In the DRC, infections most often happen in re- 
mote rural areas; in the CAR, armed conflict in 
several regions has limited surveillance. 

The virus got its name after it was first iden- 
tified in a colony of Asian monkeys in a Co- 
penhagen, Denmark, laboratory in 1958, but it 
has only been isolated from a wild monkey—in 
Africa—once. It appears to be more common 
in squirrel, rat, and shrew species, occasionally 
spilling over into the human population, where 
it spreads mainly through close contact, but not 
through breathing. Isolating infected people 
typically helps outbreaks end quickly. 

Cases have steadily increased in sub-Saharan 
Africa over the past 3 decades, driven largely by 
a medical triumph. The vaccine against small- 
pox, a far deadlier and more transmissible virus, 
also protects against monkeypox, but the world 
stopped using it in the 1970s, shortly before 
smallpox was declared eradicated. As a result, 
“There's a huge, huge number of people who are 
now susceptible to monkeypox,’ says Placide 
Mbala, a virologist who heads the genomics lab 
at the National Institute of Biomedical Research 
(INRB) in Kinshasa, DRC. 

Mbala says demographic shifts have fueled 
the rise as well. “People are more and more mov- 
ing to the forest to find food and to build hous- 
es, and this increases the contact between the 
wildlife and the population,” he says. Studies in 
the CAR showed cases spike after villagers move 
into the forest during the rainy season to collect 
caterpillars that are sold for food. “When they 
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stay in the bush they get in contact easily with 
the animal reservoir,’ says virologist Emmanuel 
Nakouné, scientific director at the Pasteur Insti- 
tute of Bangui, which in 2018 launched a pro- 
gram named Afripox with French investigators 
to better understand and fight monkeypox. 

Outbreaks outside Africa, including the cur- 
rent one, have all involved the West African 
strain, which kills about 1% of those it infects. 
The Congo Basin strain, found in the DRC and 
the CAR, is 10 times more lethal, yet despite the 
relatively high disease burden in the DRC, it has 
never left Africa. But it has never caused a seri- 
ous outbreak in a Congolese city either, which 
underscores the isolation of the areas where it is 
endemic. “It’s kind of a self-quarantine,” Mbala 
says. “Those people don't move from DRC to 
other countries.” 

Just where the current outbreak started, and 
how long ago, is unclear. “It’s a little bit like we’ve 
tuned into a new TV series and we dont know 
which episode we've landed on,’ says Anne Ri- 
moin, an epidemiologist at the University of 
California, Los Angeles, who has worked on 
monkeypox in the DRC for 20 years. The first 
patient with an identified case traveled from Ni- 
geria to the United Kingdom on 4 May, but does 
not appear to have infected anyone else. Two pa- 
tients diagnosed later, one in the United States 
and the other in the United Arab Emirates, had 
recently traveled to Africa as well, and perhaps 
imported the virus separately. But none of the 
other cases identified in recent weeks has links 
to infected travelers or animals from endemic 
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Spilling over 


The monkeypox virus infects squirrel, rat, and shrew species in at least 10 countries in West and 
Central Africa and occasionally jumps into the human population. So far this year, five countries 


have reported human cases. 


@ Countries endemic for monkeypox @ Countries with monkeypox cases or deaths reported in 2022 


Africa 


Nigeria we 


46 cases, 0 deaths 


Cameroon 
24 cases, 9 deaths 


Republic of Congo 
2 cases, 0 deaths 


Central African Republic 
8 cases, 2 deaths 


— —_________~___ Democratic Republic 
of the Congo 
1284 cases, 58 deaths 


(GRAPHIC) K. FRANKLIN/SC/IENCE; (DATA) WORLD HEALTH ORGANIZATION 


countries. Instead, many early cases were linked 
to transmission at gay festivals and saunas in 
Spain, Belgium, and Canada. 

Some suspect the virus may have been im- 
ported from Nigeria, Africa's most populous 
country, which has good infrastructure connect- 
ing rural areas to large cities and two airports 
that are among the busiest in Africa. But this is 
“highly speculative, stresses Christian Happi, 
who runs Nigeria’s African Centre of Excellence 
for Genomics of Infectious Diseases. Happi urg- 
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es people in other countries “not to point fin- 
gers,’ but to collaborate. 

Epidemiologist Ifedayo Adetifa, head of the 
Nigeria Centre for Disease Control, says the 
country receives undue attention because it does 
more surveillance than its neighbors and shares 
what it finds. “There's too much emphasis for 
whatever reasons in Western capitals and news 
media about trying to hold somebody responsi- 
ble for a particular outbreak,” he says. “We don't 
think those narratives are helpful.” Adetifa says 
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that although Nigeria has recently seen “an up- 
tick in cases, he is confident it’s not missing a 
large number of them. “We are literally rattling 
the bushes to see what comes out” 

African countries’ ability to deal with mon- 
keypox was improving even before the current 
outbreak. The DRC has stepped up its surveil- 
lance across the vast country, which is key to 
isolating infected people and tracking the virus 
moves. INRB and a lab in Goma can now diag- 
nose samples using the polymerase chain reac- 
tion assay, and researchers ultimately hope to 
develop rapid tests for use in clinics nationwide. 
INRB and labs in Nigeria can also sequence the 
full genome of the virus, and Nigeria plans to 
make public genomes of several recent monkey- 
pox isolates, Adetifa says. Those and other se- 
quences from Africa could help researchers pin- 
point the source of the international outbreak by 
building viral family trees. 

For now, Africa lacks medicines to prevent 
and treat monkeypox. In the United Kingdom 
and the United States, high-risk contacts of cases 
are being offered a vaccine produced by Bavari- 
an Nordic that was approved for monkeypox by 
the U.S. Food and Drug Administration in 2019, 
but it’s not available anywhere in Africa. The 
US. Centers for Disease Control and Prevention 
and collaborators in the DRC are testing the vac- 
cine in health care workers; the 2019 approval 
was based on animal studies. 

In the CAR, 14 people with monkeypox have 
received an experimental drug, tecovirimat, as 
part of a trial launched by the University of Ox- 
ford in July 2021. “We've had very good results,” 
says Nakouné, who says he expects the data to be 
published within the next few weeks. The drug’s 
manufacturer, SIGA, has pledged to provide up 
to 500 treatment courses to the country. 

Although the international outbreak has— 
again—highlighted global health inequities, it 
has also brought much-needed attention to the 
smoldering disease in Africa. “It’s been really 
hard to get the resources to do the kind of back- 
ground work that really needs to be done and 
that isn’t hair-on-fire, in the context of an emer- 
gency,’ Rimoin says. “We cannot keep hitting the 
snooze button. Now, the stakes are really high” 
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The puzzle of the COVID-19 pandemic in Africa 
More data are needed to understand the determinants of the COVID-19 


pandemic across Africa 


By Justin M. Maeda and John N. Nkengasong 


he COVID-19 pandemic has been 

puzzling to many public health experts 

because Africa has reported far few- 

er cases and deaths from COVID-19 

than predicted. As of 22 November 

2020, the continent of Africa, compris- 
ing 1.3 billion people, had recorded 2,070,953 
cases of COVID-19 and 49,728 deaths (1), 
representing ~3.6% of total global cases (2, 3). 
Because of the continent's overstrained and 
weak health systems, inadequate financing of 
health care, paucity in human resources, and 
challenges posed by existing endemic dis- 
eases—including HIV, tuberculosis, and ma- 
laria—earlier predictions suggested that up 
to 70 million Africans may be infected with 
severe acute respiratory syndrome coronavi- 
rus-2 (SARS-CoV-2) by June, with more than 
3 million deaths (4). On page 79 of this issue, 
Uyoga et al. (5) report a serosurvey study 
(measuring the occurrence of SARS-CoV-2 
antibodies) of blood donors in Kenya that 
suggested that the incidence of SARS-CoV-2 
infection is much higher than expected from 
case numbers. 

Using blood donor samples as a proxy, 
Uyoga et al. estimated that SARS-CoV-2 in- 
fections occurred in 5.5% of the population 
in Kisumu, 7.3% in Nairobi, and 8.0% in 
Mombasa, with an overall average of 4.3%. 
This translates to ~2.2 million total possible 
infections compared with the reported 77,585 
infections in the country as of 23 November 
2020 (1, 3). Similarly, in October 2020, Mo- 
zambique reported less than 3000 confirmed 
cases of COVID-19; however, serosurveys 
found that 5% of households in the city of 
Nampula and 2.5% of households in the city 
of Pemba had been exposed to the virus (6). 
This suggests that there may be more infec- 
tions than recorded. 

There are several factors that may influence 
the trajectory of the COVID-19 pandemic in 
Africa. These include limited testing (which 
limits detection and isolation, and thus public 
health measures), a much younger population 
(and thus fewer severe cases and deaths), cli- 
matic differences (which could affect trans- 
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mission), preexisting immunity, genetic fac- 
tors, early implementation of public health 
measures, and timely leadership. Two key as- 
pects that may contribute to our understand- 
ing of the pandemic puzzle in Africa include 
scaling up of testing and use of serosurveys. 
One way to unravel the puzzle of SARS- 
CoV-2 spread is to understand how testing 
and reporting of cases has occurred. On 14 
February 2020, the first cases of COVID-19 
were reported in Africa, and by 17 April 2020, 
the continent had conducted an estimated 
330,419 SARS-CoV-2 tests; that is 0.03% of 
the entire continent's population. In an ef- 
fort to scale up testing, the Africa Centres 
for Disease Control and Prevention (Africa 
CDC) launched the Partnership to Accelerate 
COVID-19 Testing (PACT) in April 2020. Be- 
cause of the PACT initiative, testing was scaled 
up rapidly from ~600,000 per month in April 
to ~3.5 million per month in November 2020, 
an increase of ~5.5-fold (7), with 39 of 55 
(71%) countries reporting more than 10 tests 
conducted for every case identified, as recom- 
mended by the World Health Organization 
(WHO) (8). Testing capacity has varied over 
time, with positivity rate fluctuating between 
5 and 15% regardless of the increased testing 
boost brought by the introduction of PACT. 
Therefore, it is clear that testing has been 
challenging (9, 10), which limits our under- 


standing of the full extent of the spread of 
SARS-CoV-2 infection in Africa. As such, se- 
rosurveys are critical because they can provide 
data on SARS-CoV-2 infection trends, effects 
of interventions, demographic characteriza- 
tion, and vaccine effects. Such surveys can 
also inform on planning for vaccine deploy- 
ment by providing data to guide prioritization 
between different populations. They can also 
aid understanding of the drivers of infection 
through linking current or previous infection 
with epidemiological and demographic data. 
Currently, the continent is facing a challeng- 
ing phase of the pandemic with an observed 
second wave of cases (see the figure). More 
people need to be tested in different localities, 
including repeated testing over time, so that 
the patterns and risk factors of viral spread 
can be understood. 

Several serosurveys have been conducted 
in Africa. The studies differ in methodolog- 
ical approach used: simple random sampling, 
use of existing sentinel sites, and targeted 
population (specific subnational unit, preg- 
nant women, blood donors, and people living 
with HIV). The types of laboratory tests used 
(rapid tests and enzyme-linked immunosor- 
bent assays) also differed between studies 
so as to unveil the drivers of infection and 
disease spread. Given the limited ability to 
conduct field surveys (the preferred method) 


COVID-19 cases and deaths in Africa 
The trend of daily reported cases of COVID-19 for the African continent, February to November 2020, 


shows the first peak of cases occurred July to August (mostly attributed to the Southern African Region) 
followed by a second peak, which started in October (mostly attributed to the Northern Region). 
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owing to travel restrictions, Uyoga et al. inves- 
tigated blood donors to reveal the pandemic 
puzzle in Kenya. From these surveys in Afri- 
ca, seroprevalence of SARS-CoV-2-specific 
antibodies have ranged from 2.2 to 39% of the 
population in different settings and countries. 
However, none of the studies have used a na- 
tional representative sample. 

To ensure a harmonized and standardized 
method of conducting serosurveys in Africa, 
the Africa CDC is supporting multinational 
population-based, age, and gender stratified 
serosurveys that use standardized protocol 
and data collection tools (11). The protocol 
is built on a simplistic model, using point-of- 
care rapid test for antibody detection of cur- 
rent and previous infection, to ensure feasi- 
bility and simplicity while maintaining study 
quality and credibility of the evidence gener- 
ated. A similar approach has been applied to 
national representative cohorts in Brazil and 
Spain (12, 13). 

Across Africa, policy makers are faced with 
the dilemma of striking a balance between 
limiting transmission and protecting econ- 
omies, businesses, and livelihoods. This has 
created a demand for quality and comprehen- 
sive data. Serosurveys could therefore com- 
plement existing response strategies. Such 
surveys should adhere to the following princi- 
ples: a national representative sample through 
well-designed sampling strategies that ensure 
inclusivity of all possible strata within the 
country; simplicity to guarantee feasibility 
and quick delivery; optimization of resources 
for implementation (human, material, and fi- 
nancial) to safeguard the already constrained 
resources for response; complementarity to 
already existing surveillance and response 
data; and the ability to longitudinally track 
the same aspect of data and information over 
time to inform adaptive strategies. 

Timely leadership and coordination may 
be a second aspect that explains the pandem- 
ic pattern in Africa. The continent reacted in 
a timely and collective manner once the first 
cases of SARS-CoV-2 were reported in Egypt 
on 14 February 2020. Following that, on 22 
February 2020, the Africa CDC convened an 
emergency meeting of all ministers of health 
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at the headquarters of the African Union 
Commission in Addis Ababa, Ethiopia. The 
ministers adopted a joint continental strategy 
that had three goals: limit transmission, limit 
deaths, and limit social and economic harms 
and impacts on other endemic diseases, un- 
derpinned by the need to coordinate, coop- 
erate, collaborate, and communicate efforts 
across Africa. In addition, the Africa Task- 
force on Coronavirus (AFTCOR) was estab- 
lished to help implement the strategy and was 
endorsed by the Bureau of the Heads of State 
and Governments of the African Union, a val- 
idation at the highest level of the continent. 
This approach helped blunt the early spread 
of COVID-19. 


GG Across Africa, 
policy makers are 
faced with the 
dilemma of striking 
a balance between 
limiting transmission 
and protecting 
economies, 
businesses, and 


livelihoods.9% 


Therefore, in March 2020, when several 
countries in Africa began reporting imported 
cases of COVID-19, there was clarity on the 
course of action to take. For example, as part 
of the AFTCOR, the Africa CDC rapidly sup- 
ported member states to establish diagnostics 
capacity and expanded testing capacity from 
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two countries in February to more than 43 by 
the end of March, through competency-based 
training of member countries at reference 
centers in Dakar, Senegal, and Johannesburg, 
South Africa. The coordinated approach en- 
sured harmony in response strategies. For ex- 
ample, the establishment of the African Med- 
ical Supply Platform helped to streamline the 
procurement of response commodities. 

The puzzle of the COVID-19 pandemic 
in Africa can partly be explained by decisive 
measures taken early to prepare the conti- 
nent. However, more data are needed to com- 
plement what is routinely collected through 
surveillance and response to understand the 
different pieces of the puzzle that contribute 
to the pattern of the pandemic in Africa. Se- 
rosurveys and the use of genomic epidemi- 
ology can help to better understand disease 
spread. Further understanding of factors that 
influence viral pathogenesis and clinical spec- 
trum of disease, and the impacts on endemic 
infections (HIV, tuberculosis, and malaria), 
are needed. Efforts to understand attitudes to 
COVID-19 vaccines are also a priority. 
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SARS-like viruses may jump from animals to people 


hundreds of thousands of times a year 
Study pinpoints Asian regions that could spark the next coronavirus pandemic 


By Kai Kupferschmidt 


nly two new coronaviruses have spread 

globally the past 2 decades: SARS-CoV, 

which caused an outbreak of severe 

acute respiratory syndrome (SARS) in 

2003, and SARS-CoV-2, the virus that 

causes COVID-19. But that may just be 
the tip of the iceberg of undetected infections 
with related viruses emerging from bats, a new 
paper claims. In a preprint published yesterday’ 
researchers estimate that an average of 400,000 
people are likely infected with SARS-related 
coronaviruses every year, in spillovers that never 
grow into detectable outbreaks. 

Although that number comes with big cave- 
ats, “It should be eye-opening to the entire scien- 
tific community that we don't know very much 
about the frequency of zoonotic spillover,’ says 
virologist Angela Rasmussen of the University 
of Saskatchewan, who was not involved in the 
work. That needs to change, she says, “because 
otherwise we grossly underestimate it.” 

The researchers, including Peter Daszak 
from the EcoHealth Alliance and Linfa Wang 
from Duke-NUS Medical School in Singapore, 
created a detailed map of the habitats of 23 bat 
species known to harbor SARS-related coro- 
naviruses, the group to which SARS-CoV and 
SARS-CoV-2 belong, and then overlaid it with 
data on where humans live to create a map of 
potential infection hot spots. They found that 
close to 500 million people live in areas where 
spillovers can occur, including northern India, 
Nepal, Myanmar, and most of Southeast Asia. 
The risk is highest in southern China, Vietnam, 
Cambodia, and on Java and other islands in In- 
donesia (see map, below). 

“This is a definitive analysis of where on the 
planet the next SARS- or COVID-like virus is 
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most likely to emerge,” Daszak says. The maps 
could guide efforts to reduce the likelihood of 
spillover by changing behaviors in high-risk 
communities and targeting surveillance to de- 
tect new outbreaks earlier, he says. Daszak, a vo- 
cal advocate of the hypothesis that SARS-CoV-2 
came from the wild instead of a research lab, 
says the maps could also guide efforts to find the 
virus’ natural origin. (Several studies are under- 
way or being planned to look for SARS-CoV-2 
and its relatives in Rhinolophus [horseshoe] bats 
and other animals.) 

But the researchers went one step further. 
Small surveys done before COVID-19 erupted 
have suggested some people in Southeast Asia 
harbor antibodies against SARS-related coro- 
naviruses. Combining those data with data on 
how often people encounter bats and how long 
antibodies remain in the blood, the researchers 
calculated that some 400,000 undetected human 
infections with these viruses occur each year 
across the region. 

Daszak says interactions with bats are much 
more common than people think: “Just living 
there means you're exposed: People are shelter- 
ing in caves, they’re digging guano out of caves, 
they’re hunting and eating bats.” The paper does 
not even address how many people work in the 
wildlife trade and may be infected indirectly 
when a bat virus infects another animal first, 
he says. 

Although 400,000 infections 


“T think if the seroprevalence estimate is way 
off, the whole thing collapses, says David Fis- 
man, an epidemiologist at the University of To- 
ronto, who calls the modeling “shaky.” The high 
number of hidden infections “doesn’t ring true,” 
Fisman says, because you would expect regular 
spillovers to be recognized, as they are for rabies 
and the Nipah virus. 

But Rasmussen says many infections could 
remain hidden if they are short-lived and dont 
lead to onward transmission because the viruses 
are not well adapted to humans. They might not 
infect enough cells—or cells of the right type— 
to be transmitted to another person, or they 
might not be able to escape humans’ immune 
defenses. In cases when the virus does spread, 
sheer chance may keep it confined to a small, 
isolated community. 

“A lot of the viruses are probably unable to 
be transmitted from one person to another, 
but I have very little doubt that there have been 
illnesses due to these viruses that get misdiag- 
nosed or never diagnosed,” Daszak says. “A rural 
farmer in Myanmar is hardly likely to go to the 
clinic because they've got a bit of a cough.” 

The work is part of a nascent effort to try to un- 
derstand the risk factors for viral spillover from 
animals into humans, Munster says. Already, one 
message is clear, he says: “I think for virtually 
any zoonotic pathogen from wildlife, spillover is 
more frequent than previously recognized.” 


annually sounds like a lot, Ras- 
mussen says, “in a region with 
likely hundreds of millions of 
bats and nearly half a billion 
people it ist that many.’ The 
confidence interval stretches 
from one to more than 35 mil- 
lion hidden infections per year, 
however—big enough to “fly the 
entire population of Rhinolophid 
bats through,” Rasmussen quips. 
Models are only as good as the 
data that are fed into them, says 
Vincent Munster, a virologist 
at the U.S. National Institute of 
Allergy and Infectious Diseases 
who studies coronaviruses. The 
data on antibodies only include 
a few thousand people, he notes, 
and the assays used to test for an- 
tibodies can easily lead to false 
positives. 
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A map in a new paper shows the relative spillover risk for severe acute respiratory 
syndrome-related coronaviruses. China and countries in Southeast Asia are 
potential hot spots for human infections. 
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INFECTIOUS DISEASES 


The Global Virome Project 
Expanded viral discovery can improve mitigation 


By Dennis Carroll, Peter Daszak, Nathan 
D. Wolfe, George F. Gao, Carlos M. Morel, 
Subhash Morzaria, Ariel Pablos-Méndez, 
Oyewale Tomori, Jonna A. K. Mazet 


tbreaks of novel and deadly viruses 

highlight global vulnerability to emerg- 

ing diseases, with many having massive 

health and economic impacts. Our adap- 

tive toolkit—based largely on vaccines 

and therapeutics—is often ineffective 
because countermeasure development can be 
outpaced by the speed of novel viral emergence 
and spread. Following each outbreak, the public 
health community bemoans a lack of prescience, 
but after decades of reacting to each event with 
little focus on mitigation, we remain only mar- 
ginally better protected against the next epidem- 
ic. Our ability to mitigate disease emergence is 
undermined by our poor understanding of the 
diversity and ecology of viral threats, and of the 
drivers of their emergence. We describe a Global 
Virome Project (GVP) aimed to launch in 2018 
that will help identify the bulk of this viral threat 
and provide timely data for public health inter- 
ventions against future pandemics. 

Nearly all recent pandemics have a viral etiol- 
ogy with animal origins, and with their intrinsic 
capacity for interspecies transmission, viral zoo- 
noses are prime candidates for causing the next 
great pandemic (1, 2). However, if these viruses 
are our enemy, we do not yet know our enemy 
very well. Around 263 viruses from 25 viral 
families are known to infect humans (3) (see the 
figure), and given the rate of discovery following 
identification of the first human virus (yellow 
fever virus in 1901), it is likely many more will 
emerge in the future (4). We estimate, from anal- 
ysis of recent viral discovery data (5), that ~1.67 
million yet-to-be-discovered viral species from 
key zoonotic viral families exist in mammal and 
bird hosts—the most important reservoirs for 
viral zoonoses (supplementary text). 

By analyzing all known viral-host relation- 
ships (3, 6), the history of viral zoonoses (7), and 
patterns of viral emergence (1), we can reason- 
ably expect that between 631,000 and 827,000 of 
these unknown viruses have zoonotic potential 
(supplementary text). We have no readily avail- 
able technological countermeasures to these 
as-yet-undiscovered viruses. Furthermore, the 
rate of zoonotic viral spillover into people is ac- 
celerating, mirroring the expansion of our global 
footprint and travel networks (1, 8), leading to a 
nonlinear rise in pandemic risk and an exponen- 
tial growth in their economic impacts (8). 
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PROMISING PILOT, CHALLENGING SCALE 
Since 2009, the U.S. Agency for International De- 
velopment (USAID) has conducted a large-scale 
pilot project, spanning more than 35 countries 
over 8 years at a cost of around $170 million, to 
evaluate the feasibility of preemptively mitigat- 
ing pandemic threats. Other previous studies 
had begun to conduct targeted viral discovery in 
wildlife (9), and develop mitigation strategies for 
the emergence of avian flu, for example. How- 
ever, the USAID Emerging Pandemic Threats 
(EPT) PREDICT project is the first global-scale 
coordinated program designed to conduct viral 
discovery in wildlife reservoir hosts, and char- 
acterize ecological and socioeconomic factors 
that drive their risk of spillover, to mitigate their 
emergence in people (10). 

Working with local partners and govern- 
ments, wildlife and domestic animals and at- 
risk human populations in geographic hotspots 
of disease emergence (1) are sampled, and vi- 
ral discovery conducted. A strategy to identify 
which novel viruses are most at risk of spillover 
has been developed (11), and further work is 
conducted on these to characterize them prior 
to, or in the early stages of, spillover. Metadata 
on the ecology of wildlife-livestock-human 
transmission interfaces, and on human behav- 
ioral patterns in communities, are concurrently 
analyzed so that strategies to reduce spillover 
can be developed (supplementary text). To date, 
EPT PREDICT has discovered more than 1000 
viruses from viral families that contain zoono- 
ses, including viruses involved in recent out- 
breaks (12), and others of ongoing public health 
concern (13). The focus of EPT PREDICT on ca- 
pacity building, infrastructure support, training, 
and epidemiological analysis differs substantial- 
ly from the GVP’s emphasis on large-scale sam- 
pling and viral discovery. However, to discover 
the bulk of the projected remaining 1.67 mil- 
lion unknown viruses in animal reservoirs and 
characterize the majority of 631,000 to 827,000 
viruses of highest zoonotic potential requires 
overcoming some challenges of scale. 

The first challenge is cost. To estimate this, 
we analyzed data on field sampling and labora- 
tory expenditures for viral discovery from (5, 
10), and estimates of unknown viral diversity 
in mammalian and avian hosts (supplementa- 
ry text). We estimate that discovery of all viral 
threats and characterization of their risk for 
spillover, using currently available technologies 
and protocols, would be extremely costly at over 
$7 billion (supplementary text). However, pre- 
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vious work shows that viral discovery rates are 
vastly higher in the early stages of a sampling 
program, and that discovering the last few, rare, 
viruses is extremely costly and time-consuming 
owing to the number of samples required to 
find them (5) (supplementary text). We used 
data on rates of viral discovery (5) to estimate 
that the substantial majority of the viral diver- 
sity from our target zoonotic reservoirs could 
be discovered, characterized, and assessed for 
viral ecology within a 10-year time frame for 
~$1.2 billion (16% of total costs for 71% of the 
virome, considering some fixed costs) (fig. $1). 
Those viruses remaining undiscovered will, by 
the nature of sampling bias toward more com- 
mon host species, represent the rarest viruses 
with least opportunity for spillover, and there- 
fore reduced public health risk. Their discovery 
would require exponentially greater sampling 
effort and funding that could be better spent 
on countermeasures for the more likely threats 
(supplementary text). 

Stakeholders from Asia, Africa, the Ameri- 
cas, and Europe, spanning industry, academia, 
intergovernmental agencies, nongovernmental 
organizations (NGOs), and the private sector, 
began meeting in 2016 to design a framework 
for the governance, management, technical op- 
eration, and scope of the GVP. Key efforts in- 
clude developing finance streams; establishing a 
transparent, equitable implementation strategy; 
designing data- and sample-sharing protocols; 
developing laboratory and metadata platforms; 
targeting of host taxa and sampling sites; analyz- 
ing return on investment; forming collaborative 
field and laboratory networks; developing risk 
characterization frameworks for viruses discov- 
ered; designing a strategy to assess and mitigate 
risk behaviors that facilitate viral emergence; 
and planning in-country capacity building for 
sustainable threat mitigation. Funding has been 
identified to support an initial administrative 
hub, and fieldwork is planned to begin in the first 
two countries, China and Thailand, during 2018. 

With outputs intended to serve the global 
public good, the GVP is developing a transpar- 
ent and equitable strategy to share data, viral 
samples, and their likely products, including 
benefits derived from future development of 
medical countermeasures. These build on the 
Nagoya Protocol to the Convention on Biolog- 
ical Diversity and the Pandemic Influenza Pre- 
paredness Framework, negotiated by the World 
Health Organization (WHO). The international 
collaborative nature and global ownership of 
the project should help leverage funding from 
diverse international donors, including govern- 
ment agencies focused on national virome proj- 
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ects or on international development projects in 
other countries, and private-sector philanthropic 
donors focused on technology and big science. 

The diversity of tasks required to conduct 
the GVP should reduce the potential for it to 
divert funds from current public health pro- 
grams. For example, discrete work streams on 
targeted sampling of wildlife, on bioinformatics, 
and on behavioral risk analysis fall within the 
focus of current scientific research programs 
in a range of donor agencies. Governments and 
corporations with specific remits and geograph- 
ic responsibilities have been approached to fi- 
nance subprojects relevant to their sectors (e.g., 
capacity development, surveillance of specific 
taxa, geographically focused activities, medical 
countermeasure development, training, surveil- 
lance, and technological platforms). In addition, 
leaders in China and a number of countries have 
begun developing national virome projects as 
part of the GVP, leveraging current research 
funding to include GVP sites. 

Technological challenges include safe field 
sampling in remote locations and cost-effective 
laboratory platforms that can be standardized in 
low-income settings. To achieve these goals, ex- 
isting national, regional, and international net- 
works will need to be enhanced and expanded 
within standardized sampling and testing frame- 
works. Existing networks of field biologists from 
environment ministries, academic institutions, 
and conservation and health NGOs may assist 
in surveillance. National science and technology 
agencies, regional One Health platforms, trans- 
boundary disease surveillance networks, Insti- 
tut Pasteur laboratories, WHO, United Nations 
Food and Agricultural Organization, and the 
World Organization for Animal Health collabo- 
rating, and reference centers and viral discovery 
laboratories, including USAID EPT PREDICT, 
are currently involved in planning these activ- 
ities around a decade-long sampling and test- 
ing time frame. A monitoring and evaluation 
strategy is being developed based on analysis 
of viral discovery rates against predicted viral 
diversity, to identify when to halt surveillance 
and testing as the GVP progresses. Stakeholders 
will also tackle the challenge of how to decide 
when enough potentially dangerous viruses 
have been discovered in a host species or region 
to call for action to reduce underlying drivers of 
emergence (e.g., hunting and trading of a wild- 
life reservoir). Laboratory platforms developed 
by USAID EPT PREDICT have proven capacity 
to identify novel viruses and are relatively inex- 
pensive and reliable, being based on polymerase 
chain reaction using degenerate primers that 
target a range of viral families of known zoonot- 
ic potential. However, scaling up to a full global 
virome project will require discovery of three or- 
ders of magnitude more viruses in a similar time 
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frame. Technological solutions will be needed to 
increase the speed and efficiency, and reduce the 
cost, of sequence generation. These will likely 
include next-generation sequencing and other 
unbiased approaches to identify evolutionarily 
distinct viral clades. 

A key challenge is how to assess the poten- 
tial for novel viruses to infect people or become 
pandemic (14). The EPT PREDICT project 
(11) and others (2, 6) have developed prelimi- 
nary zoonotic risk characterization frameworks 
based on viral and host traits and the ecological 
and demographic characteristics of the sampling 
site. These approaches will be used in the GVP 
to triage novel viruses for further characteriza- 
tion to assess their zoonotic capacity (supple- 
mentary text). In vitro receptor binding analyses 
coupled with in vivo models have proven use- 
ful in this capacity for some viral families [e.g., 
coronaviruses (13)]. Although this is not yet 
feasible for all potentially zoonotic viral clades, 


GVP targeting strategy 

The project will capitalize on economies of scale 

in viral testing, systematically sampling 

mammals and birds to identify currently unknown, 
potentially zoonotic viruses that they carry. 


Cc) 


111 viral families have been 
discovered globally to date. 


applying these techniques to a larger viral data 
set as the GVP progresses will allow validation 
of risk frameworks and may increase our capac- 
ity to predict zoonotic potential. However, ad- 
vancing these goals will require new collabora- 
tion among lab virologists, epidemiologists, and 
modelers, innovative approaches to field-testing 
the boundaries of virus-host relationships, and 
support across agencies that often fund separate 
virology, public health, evolutionary biology, 
and biodiversity modeling initiatives. 


INVESTMENTS, RETURNS 

The cost of the GVP represents a sizable in- 
vestment and, even if a large number of poten- 
tial zoonoses are discovered, only a minority is 
likely to have the potential to cause large-scale 
outbreaks and mortality in people (1, 2, 7). 
However, given the high cost of single epidemic 
events, data produced by the GVP may provide 
substantial return on investment by enhancing 
diagnostic capacity in the early stages of a new 
disease outbreak or by rapidly identifying spill- 
over hosts, for example. Recent analysis of the 
exponentially rising economic damages from 
increasing rates of zoonotic disease emergence 
suggests that strategies to mitigate pandem- 
ics would provide a 10:1 return on investment 
(1, 8). Even small reductions in the estimated 
costs of a future influenza pandemic (hundreds 
of billions of dollars) or of the previous SARS 
(severe acute respiratory syndrome) epidemic 
($10 billion to $30 billion) could be substantial. 
The goal of the GVP is to improve efficiency in 
the face of these increasing viral spillover rates 
by enhancing (not replacing) current pandem- 
ic surveillance, prevention, and control strat- 
egies. If we were to invest only in surveillance 
for known pathogens (our current business-as- 
usual strategy), our calculations suggest we 
would protect ourselves against less than 0.1% 
of those viruses that could conceivably infect 
people, even using the lower bounds of our un- 
certainty for our viral estimates (i-e., 263 viruses 
known from humans out of 263,824 unknown 
potential zoonoses; supplementary text). 

The potential benefits of the GVP may be 
enhanced to maximize public health benefits 
(supplementary text) by (i) optimizing sampling 
to target species most likely to harbor “missing 
zoonoses” (6), or to target emerging disease 
hotspot regions most likely to propagate major 
disease outbreaks (1); (ii) using human and live- 
stock syndromic surveillance to identify regions 
for wildlife sampling proximal to repeated out- 
breaks of severe influenza-like-illnesses, fevers 
of unknown origin, encephalitides, livestock 
“abortion storms,’ and other potential emerg- 
ing disease events; (iii) initially targeting RNA 
viruses, which caused 94% of the zoonoses doc- 
umented from 1990 to 2010; and (iv) fostering 


sciencemag.org SCIENCE 


economies of scale and adoption of techno- 
logical innovation as the GVP ramps up. This 
includes use of laboratories that can facilitate 
regional sample processing, development of cen- 
tralized bioinformatics platforms, and improved 
logistics for sample collection and transport. We 
also expect the cost of testing and sequencing 
to decrease as technology is enhanced, much as 
the development of next-generation sequencing 
reduced genetic sequencing costs by up to four 
orders of magnitude in a decade. 

The accelerated pace of viral discovery under 
the GVP will make the virological, phylogenet- 
ic, and modeling approaches used in pandemic 
preparedness more data-rich, and likely more 
effective. For example, having the sequence data 
for thousands, rather than a few, viruses from a 
single family could extend vaccine, therapeutic, 
or drug development to a wider range of targets, 
leading to broad-spectrum vaccines and other 
countermeasures. Identification of novel virus- 
es may be useful to programs like the Coalition 
for Epidemic Preparedness Innovations (CEPI) 
in assessing the breadth of action of candidate 
vaccines and therapeutics, and in expanding 
their efficacy. More broad-scale prevention ap- 
proaches could provide immediate return on 
investment prior to vaccine and countermeasure 
development, which would require substantial 
investment and time. For example, metadata on 
viral reservoir host identity, geography, season- 
ality, proximity to people, and drivers of emer- 
gence will refine our mechanistic understanding 
of spillover and enhance published models of 
emerging infectious diseases risk (1, 6). Iden- 
tification of novel viruses in hunted, traded, or 
farmed wildlife species could be used to enhance 
biosecurity in markets and farming systems, re- 
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ducing public health risk, increasing food se- 
curity, and assisting in conservation of hunted 
species. The presence of hosts harboring high- 
risk novel viruses in proximity to human popu- 
lations may allow targeted follow-up to examine 
evidence of spillover and design intervention 
strategies (supplementary text). Ultimately, the 
benefits of the GVP may include enhancing our 
understanding of viral biology, such as drivers 
of competition or cooperation among viruses 
within hosts, genomic underpinnings of host- 
virus coevolution, processes underlying deep 
evolution of viral clades, and the identification 
of novel viral groups (15). 

The regions targeted by the GVP are largely 
highly biodiverse, rapidly developing countries 
in the tropics, which often have low capacity 
to deal with public health crises (1). The ex- 
panded laboratory capacity, field sampling, and 
data generation intrinsic to the GVP goals will 
therefore improve capacity to detect, diagnose, 
and discover viruses in vulnerable populations 
within regions most critical to preventing future 
pandemics. This enhanced capacity may also 
help improve diagnosis and control for endemic 
diseases, as well as the portion of the virome that 
remains undiscovered. 

The Human Genome Project in the 1980s 
catalyzed technological innovation that dramat- 
ically shortened the time and cost for its com- 
pletion, and ushered in the era of personalized 
genomics and precision medicine. The GVP will 
likely accelerate development of pathogen dis- 
covery technology, diagnostic tests, and science- 
based mitigation strategies, which may also 
provide unexpected benefits. Like the Human 
Genome Project, the GVP will provide a wealth 
of publicly accessible data, potentially leading to 
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discoveries that are hard to anticipate, perhaps 
viruses that cause cancers and chronic physio- 
logical, mental health, or behavioral disorders. 
It will provide orders-of-magnitude more in- 
formation about future threats to global health 
and biosecurity, improve our ability to identify 
vulnerable populations, and enable us to more 
precisely target mitigation and control measures 
to foster an era of global pandemic prevention. 
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COVID-19 testing: One size does not fit all 


To control the pandemic, testing should be considered a public health tool 


By Michael J. Mina’? and 
Kristian G. Andersen** 


ests for detecting severe acute respira- 
tory syndrome coronavirus 2 (SARS- 
CoV-2) were developed within days of 
the release of the virus genome (1). Mul- 
tiple countries have been successful at 
controlling SARS-CoV-2 transmission 
by investing in large-scale testing capacity (2). 
Most testing has focused on quantitative poly- 
merase chain reaction (qPCR) assays, which 
are capable of detecting minute amounts of 
viral RNA. Although powerful, these molecu- 
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lar tools cannot be scaled to meet demands for 
more extensive public health testing. To combat 
COVID-19, the “one-size-fits-all” approach that 
has dominated and confused decision-making 
with regard to testing and the evaluation of 
tests is unsuitable: Diagnostics, screening, and 
surveillance serve different purposes, demand 
distinct strategies, and require separate approv- 
al mechanisms. By supporting the innovation, 
approval, manufacturing, and distribution of 
simpler and cheaper screening and surveillance 
tools, it will be possible to more effectively limit 
the spread of COVID-19 and respond to future 
pandemics. 


Originally published 8 January 2021 


Many types of tests are available for 
COVID-19 for clinical and public health use 
(see the figure). Testing can be performed in a 
central laboratory, at the point of care (POC), or 
in the community at the workplace, school, or 
home. COVID-19 testing begins with specimen 
collection. For medical use, a nasopharyngeal 
swab collected by a health care professional has 
been used for detection of virus infections. De- 
mands on testing throughput for COVID-19, 
however, have driven new collection approach- 
es, including saliva and less invasive nasal swabs. 
COVID-19 tests include molecular tests such as 
qPCR, isothermal amplification, and CRISPR, 
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as well as antigen tests that detect SARS-CoV-2 
proteins directly. Although rapid antigen tests 
have lower analytical sensitivity (ie. require 
greater amounts of virus material to turn posi- 
tive) than qPCR-based tests, their ability to de- 
tect infectious individuals with culturable virus 
is as high as for qPCR (3). Specificity (i.e., cor- 
rectly identifying those not infected with SARS- 
CoV-2) of antigen tests achieves comparable 
results to molecular tests (4). 

Diagnostic testing for COVID-19 focuses on 
accurately identifying patients who are infected 
with SARS-CoV-2 to establish the presence or 
absence of disease and is performed on symp- 
tomatic patients or asymptomatic individuals 
who are at high risk of infection. This type of 
testing requires assays that are highly sensitive, 
so as to not miss COVID-19 patients (false nega- 
tives), and specific, so as to not wrongly diagnose 
SARS-CoV-2-negative individuals as having 
COVID-19 (false positives). These tests are typ- 
ically performed by centralized high-complexity 
laboratories with specialized equipment using 
qPCR assays, with results that can be report- 
ed within 12 to 48 hours. Major bottlenecks in 
testing, however, have led to turnaround times 
exceeding 5 to 10 days in some regions, making 
such tests useless to prevent transmission. 

POC diagnostic testing at medical facilities 
can be qPCR assays, isothermal amplification, 
or antigen-based (4). These POC tests often 
require instruments that run a limited number 
of tests and can return results in under an hour. 
The need for an instrument limits the number 
of tests that can be performed and where they 
can be used. However, newer antigen tests are 
becoming available that do not require instru- 
ments or skilled operators, potentially allowing 
for much more distributed POC testing. 

Surveillance testing of populations can be 
used both as a tool for understanding historical 
exposures and as a measure of ongoing commu- 
nity transmission. For the former, serological 
testing of individuals for the presence of SARS- 
CoV-2-specific antibodies is used to identify 
those previously infected. For the latter, surveil- 
lance testing can be an effective way to monitor 
real-time SARS-CoV-2 spread in communities. 
One promising method is wastewater surveil- 
lance, which has been used to assess community 
transmission of polio virus (5) and has shown 
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potential for COVID-19 (6). qPCR testing of 
wastewater is used to detect SARS-CoV-2, and 
frequency dynamics of viral genetic material 
indicate COVID-19 infections in a community. 
Surveillance can also be performed from swab 
or saliva samples taken directly from individ- 
uals, and, in populations with low COVID-19 
prevalence, pooling can be used to increase ca- 
pacity and lower cost. 

For surveillance testing, the goal is not identi- 
fication of every case but rather the collection of 
data from representative samples that accurately 
measure prevalence and serve to inform public 
health policy and resource allocation. Because 
the focus is on extrapolations to the population 
and not the individual, tests with known devi- 
ations from 100% sensitivity and specificity are 
still appropriate when the variance can be statis- 
tically corrected (7). To be most effective, results 
should include reported qPCR cycle thresholds, 
which is an estimate of viral load (7), to mod- 
el epidemic trajectory and allow for real-time 
evaluation of mitigation programs (8), including 
once vaccination programs have begun. 

Screening testing of asymptomatic individ- 
uals to detect people who are likely infectious 
has been critically underused yet is one of the 
most promising tools to combat the COVID-19 
pandemic (9). Infection with SARS-CoV-2 does 
not lead to symptoms in ~20 to 40% of cases, 
and symptomatic disease is preceded by a pre- 
symptomatic incubation period (10). However, 
asymptomatic and presymptomatic cases are key 
contributors to virus spread, complicating our 
ability to break transmission chains (10). 

Entry screening to detect infectious indi- 
viduals before accessing facilities (e.g., nursing 
homes, restaurants, and airports), along with 
symptom screening and temperature checks, 
can be beneficial, particularly in high-risk fa- 
cilities such as skilled nursing facilities. When 
used strategically, entry-screening measures can 
be effective at suppressing transmission. Entry 
screening requires testing that provides rap- 
id results—ideally within 15 min—to be most 
effective. The required sensitivity and speci- 
ficity of entry-screening tests are, like all tests, 
context dependent. Entry-screening tests for 
a nursing home, for example, must be highly 
sensitive because the consequences of bring- 
ing SARS-CoV-2 into a nursing home can be 
devastating. Such tests must also be highly spe- 
cific because the consequences of grouping a 
false-positive person with COVID-19-positive 
individuals could be deadly. Conversely, because 
children have substantially reduced mortality 
from COVID-19, entry screening into schools 
might require greater compromise that balanc- 
es resources and sensitivity to test as many in- 
dividuals as possible with a need to minimize 
disruptive false positives. Key to use of tests for 


entrance screening is that a negative test alone 
should not be considered sufficient to enter— 
that should be based on satisfying other require- 
ments, including masks and physical distancing. 
Conversely, a positive test should be sufficient to 
bar entry in most settings. 

Public health screening is potentially the 
most powerful form of COVID-19 testing, 
aimed at outbreak suppression through maxi- 
mizing detection of infectious individuals. This 
type of screening entails frequent serial testing 
of large fractions of the population, through self- 
administered at-home rapid tests, or in the com- 
munity at high-contact settings, such as schools 
and workplaces (9). Public health screening can 
achieve herd effects by stopping onward spread 
through detection of asymptomatic or presymp- 
tomatic cases (fig. $1). 

Notably, not every transmission chain needs 
to be severed to achieve herd effects. Mathemat- 
ical models that incorporate relevant variation 
in viral loads and test accuracy suggest that with 
frequent testing of a large fraction of a popula- 
tion, a sufficient number of cases could be de- 
tected to create herd effects (11). For example, 
Slovakia undertook public health screening to 
address COVID-19 (12): During a 2-week peri- 
od, ~80% of the population was screened using 
rapid antigen tests. With 50,000 cases identified, 
combined with other public health measures, it 
reduced incidence by 82% within 2 weeks (12). 
An important feature of large-scale public health 
screening is that centrally controlled reporting 
and contact tracing programs are not essential 
to induce herd effects as they are for surveillance 
testing. In a robust public health screening pro- 
gram, sufficient numbers of people are routinely 
testing themselves, such that contact tracing is 
subsumed by the screening program (11). 

Similar to home pregnancy tests, screening 
tests should be easy to obtain and administer, 
fast, and cheap. Like diagnostic tests, these tests 
must produce very low false-positive rates. If 
a screening test does not achieve high-enough 
specificity (e.g., >99.9%), screening programs 
can be paired with secondary confirmatory 
testing. Unlike diagnostic tests, however, the 
sensitivity of screening tests should not be de- 
termined based on their ability to diagnose pa- 
tients but rather by their ability to accurately 
identify people who are most at risk of trans- 
mitting SARS-CoV-2. Such individuals tend to 
have higher viral loads (13), which makes the 
virus easier to detect (14). A focus on identify- 
ing infectious people means that frequency and 
abundance of tests should be prioritized above 
achieving high analytical sensitivity (11). In- 
deed, loss in sensitivity of individual tests, with- 
in reason, can be compensated for by frequency 
of testing and wider dissemination of tests (9). 
In addition, public health messaging should en- 
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sure appropriate expectations of 
screening, particularly around sen- 
sitivity and specificity so that false 
negatives and false positives do not 
erode public trust. 

Tests for public health screen- 


COVID-19 testing strategies 

Testing for SARS-CoV-2 can be for personal or population health. Collection can 
be from symptomatic or asymptomatic individuals, as well as from wastewater 
and swabs of surfaces. The tests may be performed in central laboratories, at the 
POC, or using rapid tests. Attributes of tests differ according to application. 
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tally different from testing a person 
to prevent onward transmission. 
Regulatory pathways should be 
modified to incorporate these dif- 
ferences so that public health and 
screening tests are appropriately 


ing require rapid, decentralized . ee evaluated. It is necessary to be in- 
solutions that can be scaled for Personal health tT Population health TTUTTT novative and produce, distribute, 
frequent screening of large num- and continuously improve the tests 
bers of asymptomatic individu- that exist to save lives and gain con- 
. Symptomatic Asymptomatic Everyone : 

als. Lateral-flow antigen tests and At-risk Wastewater trol of the COVID-19 pandemic. 
upcoming paper-based synthetic Surfaces Entrance Public health 
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cheap, can be self-administered, 
and do not require machines to 
run and return results. The Abbott 
BinaxNOW rapid antigen test, 
which recently received an Emer- 
gency Use Authorization (EUA) in 
the United States as a diagnostic de- 
vice, also comes with a smartphone 
app, allowing self-reporting of COVID-19 status 
that could be used instead of centralized report- 
ing by public health agencies. Critically, despite 
being shown to be highly effective at detecting 
infectious individuals (14), very few of these 
tests are currently approved for screening of as- 
ymptomatic individuals, substantially limiting 
their utility. If such tests were made available 
direct to consumer (priced to allow equitable 
access) or produced and provided free of charge 
by governments, individuals could obtain their 
COVID-19 status at their own choosing and 
without complex medical decisions. 

Testing is a central pillar of clinical and pub- 
lic health response to global health emergencies, 
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including the COVID-19 pandemic. Nearly all 
testing modalities have a role, and the one-size- 
fits-all approach to testing by many Western 
countries has failed. Many lower- and middle- 2 
income countries—including Senegal, Viet- 
nam, and Ghana—have fared far better in their 
COVID-19 response, often using strong testing 3. 
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cused on qPCR-based clinical diagnostics not 
only slows the development and deployment 
of new surveillance and screening tests but also 
confuses the picture of what metrics effective 
public health tools should achieve. Testing to di- 
agnose a patient with COVID-19 is fundamen- 
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Antimicrobial resistance: 
A global challenge 


IN 1945, ALEXANDER FLEMING SHARED THE NOBEL PRIZE FOR PHYSIOLOGY OR MEDI- 
cine in recognition of his part in the discovery of penicillin. In his Nobel Lecture, he warned that “it 
is not difficult to make microbes resistant to penicillin,” and that people may die as a result of bacteria 
becoming resistant to antibiotics in the future. Fleming’s prediction has come true. Estimates suggest 
that around 25,000 people in Europe (http://bit.ly/1hi4oR7) and 23,000 people in the United States 
(https://www.cdc.gov/drugresistance/pdf/ar-threats-2013-508.pdf) die each year from infections 
caused by bacteria that are resistant to antibiotics. Meanwhile, in Asia infections caused by resistant 
bacteria kill one child every five minutes. The problem of antimicrobial resistance can only be solved 
through concerted global action. Antimicrobials include antibiotics to combat bacterial infections as 
well as treatments for infections caused by viruses and fungi. The greatest threat is the rapid growth of 
resistance to our current armamentarium of antibiotics. 

The development of antimicrobial resistance is due to the natural Darwinian process of evolution. 
When bacterial cells are exposed to antibiotics, those cells without natural resistance to the drugs will 
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be killed; those with resistance will thrive in an environment of reduced competition. Given repeated 
antibiotic exposure and time, any bacterial species will develop some form of resistance. Society’s mis- 
use of antibiotics over the last century has exacerbated and expedited this process, and turned resistance 
into a current and grave threat to modern medicine. Without antibiotics, the infection-related risks of 
procedures—from surgery to organ transplants—will substantially increase mortality rates. Treating 
cancer patients on chemotherapy with antibiotics to prevent infection will become impossible. 

The economic cost of allowing antimicrobial resistance to continue is likely to be extreme. The costs 
associated with antimicrobial resistance are already large, and will grow exponentially if the problem is 
not tackled. Costs will spiral as we sustain longer hospital admissions and resort to the few newer, more 
expensive antibiotics. Annual health care costs and productivity losses attributable to antimicrobial 
resistance are difficult to calculate and estimates vary considerably, yet all the estimates are large. One 
estimate suggests that the total societal and health care cost in Europe is €1.5 billion, and another esti- 
mate suggests that the total cost in the United States is $55 billion (http://bit.ly/1hi4oR7; https://www. 
cdc.gov/drugresistance/pdf/ar-threats-2013-508.pdf). 

Immediate and coordinated global action is required to preserve the effectiveness of the current 
antimicrobial armoury (often referred to as “stewardship”) and to incentivise the development of new 
antimicrobials. Preservation of antibiotic effectiveness begins at the individual level. Throughout so- 
ciety, assiduous hygiene practices need to be reinforced. The effect of regular handwashing in health 
care settings on incidence rates of infections caused by resistant bacteria is well-documented (http:// 
bit.ly/1hi7PXS). In the community, improved sanitation and hygiene practices are likely to reduce both 
the burden of infection (and hence the use of antibiotics) as well as the spread of resistant organisms. 

Individual action is also required on the part of prescribers. A complex web of influence drives 
prescribing behavior, including demands from patients, competition from alternative systems of health 
care, and in some countries, financial incentives to prescribe. To reduce the likelihood of the develop- 
ment of resistance, we must ensure that prescribers offer the right antibiotic, at the right dose, for the 
right amount of time. This includes narrowing the spectrum of antibiotic therapy in a timely manner 
as microbiological evidence of the causative agent becomes available. This could be aided through the 
development of rapid and cheap point-of-care diagnostics, which may help to reduce the incidence of 
initial prescriptions for broad-spectrum antibiotics. 

Governments around the world need to tackle the gross misuse of antibiotics. For example, most 
of the 100,000 to 200,000 metric tons of antibiotics manufactured every year is used in the agricultur- 
al, aquacultural, and veterinary sectors (http://bit.ly/1hi9igO). A pricing paradox exists in farming in 
which antibiotics, a scarce natural resource, cost less than implementation of more effective hygiene 
practices. Indeed, antibiotics are used in many countries to promote animal growth and hence increase 
meat yield, rather than for the direct treatment of infections. Reduction of antibiotic use in animal rear- 
ing must be achieved while maintaining the security of the food supply. We need to better understand 
the link between animal antibiotic use and human health, and the costs of inaction to society, the econ- 
omy, and our health. International collaboration on the introduction, surveillance, and enforcement of 
guidelines regarding the appropriate use of antibiotics could be fostered through a resolution passed by 
the Food and Agriculture Organisation of the United Nations (www.fao.org/home/en). 

In many countries, an over-the-counter supply of antibiotics without a prescription is the norm. This 
greatly reduces control of the antibiotic supply, as well as our ability to monitor the use of antibiotics 
and the development of resistance. However, in countries where access to the advice of qualified med- 
ical practitioners is limited, such supplies probably save many lives. There is a balance to be struck be- 
tween limiting the availability of antibiotics and ensuring timely access to treatment for those who need 
them. Such a balance is difficult to strike in countries with robust health care systems, but is exponen- 
tially more difficult in those with limited health facilities. The increasing pattern of resistance shown 
by the Enterobacteriaceae, the leading cause of newborn sepsis in developing countries, demonstrates 
that resistance is a serious threat in these countries. Concerted global action is required to tackle this 
problem and also to ensure equity of access to effective treatments. Above all, it is important that the 
needs of all countries, regardless of income or geographical location, are considered as plans to tackle 
antimicrobial resistance are developed. 

To this end, the World Health Organization (WHO) executive board adopted a resolution on anti- 
microbial resistance in January of this year. If passed by member states at the World Health Assembly 
next week, the resolution will ask the WHO to develop a global action plan to tackle this major public 
health issue. In addition, passage of the resolution will mean that the WHO will support individual 
countries in developing their own strategies for dealing with antimicrobial resistance. This resolution is 
an important step toward tackling antimicrobial resistance worldwide. 

There is also a pressing need to address the drug-development pipeline. We have previously been for- 
tunate enough to have a continued supply of antibiotics to replace those to which bacteria have become 
resistant, but there has not been a new class of antibiotic developed since 1987 (http://bit.ly/1hibxAM). 
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The development of new antibiotics requires innovative funding solutions to promote research in this 
field while removing the incentive for pharmaceutical companies to attempt to maximize sales volume 
of any newly developed product. Pharmaceutical companies are currently not incentivized to focus on 
antibiotics compared with other drugs for long-term conditions, such as statins and antihypertensives 
for cardiovascular disease, because antibiotics are given for shorter periods of time and therefore are far 
less profitable. Drugs for chronic conditions are taken every day for the rest of a patient's life, whereas 
antibiotics may only be taken for a week once every few years. New models need to be developed such 
as decoupling sales volume from monetary reward. Such models may be public-private partnerships, 
advanced purchase agreements, or something entirely new and are likely to require collaborations 
among academia, research funders, and not-for-profit organizations. Because of the international na- 
ture of regulations and in order to secure widespread availability of new products and foster global 
cooperation, any economic model developed would need to be standardized across nations. As such, 
the model will need to be acceptable to all nations. 

Antimicrobial resistance is of such concern to the UK that it is now on the risk registers for both the 
Department of Health and the Department for Environment, Food and Rural Affairs, alongside threats 
such as climate change. These risk registers assess the likelihood and potential impact of major risks 
that may directly affect the UK within the next 5 years and allow the relevant departments to plan their 
response to such events. Antimicrobial resistance, however, can only be successfully addressed through 
international cooperation. Given that resistant bacteria can be easily transmitted between countries, 
any mechanism that incentivizes the development of new antibiotics will only be effective if applied to 
the international pharmaceutical market as a whole. New basic and translational research efforts will be 
needed to identify new microbial targets and to develop drugs against them. We also need to galvanize 
international collaborations to prevent the global spread of antimicrobial resistance. This includes pre- 
vention and control, surveillance and monitoring, as well as stewardship and conservation. By working 
together as countries, we believe that sustainable solutions can be developed. Passage of WHO’s resolu- 
tion on antimicrobial resistance at the World Health Assembly next week would be an important first 
step in helping to solve this international crisis. 


- Simon J. Howard, Sarah Hopwood, Sally C. Davies 
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hirty-five years ago, it was believed that 
the health burden of infectious diseas- 


Modeling infectious disease es was close to becoming insignificant 
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dynamics in the complex landsca pe and vaccines brought about a steady de- 


of global health 


cline in overall mortality (1). In recent 
decades, however, it has become clear that the 
threat from serious infectious diseases will per- 
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of infections play out on a wide range of interconnected temporal, organizational, and spatial 
scales, which span hours to months, cells to ecosystems, and local to global spread. Moreover, 
some pathogens are directly transmitted between individuals of a single species, whereas others 
circulate among multiple hosts, need arthropod vectors, or can survive in environmental reservoirs. 
Many factors, including increasing antimicrobial resistance, increased human connectivity and 
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for emergence and spread in the coming de- 
cades are likely to increase as a result of popu- 
lation growth, increased urbanization and land 
changes, greater travel, and increased livestock 
production to meet demands from the world’s 
expanding population (4-8). In our modern 
world of instant communication, the changing 
behavior of individuals in response to publicity 
about epidemics can have profound effects on 
the course of an outbreak (9, 10). Phylogenetic 
data shed light on an additional layer of com- 
plexity (11), as will increased understanding of 
the human genome in relation to susceptibility, 
infectiousness, and its duration. At the same 
time, the development of effective new vaccines 
remains a difficult challenge, especially for an- 
tigenically very variable pathogens (e.g., HIV 
or falciparum malaria) and for pathogens that 
stimulate immunity that is only partly protec- 
tive (e.g., Mycobacterium tuberculosis) or tem- 
porary (e.g., Vibrio cholerae). 

In the face of this complexity, computational 
tools (Box 1) are essential for synthesizing infor- 
mation to understand epidemiological patterns 
and for developing and weighing the evidence 
base for decision-making. Here, we review the 
contribution of these tools to our understanding 
of infectious disease dynamics for public health 
by using representative examples and by rang- 
ing into current developments. We argue that to 
improve decision-making for human health and 
for sustaining the health of our food systems, 
experts on infectious disease dynamics and ex- 
perts on prevention and control need to collab- 
orate on a global scale. To succeed, quantitative 
analysis needs to lie at the heart of public health 
policy formulation. 


Models and public health policy formulation 
The value of mathematical models to investigate 
public health policy questions was recognized at 
least 250 years ago when, in 1766, Daniel Ber- 
noulli published a mathematical analysis of the 
benefits of smallpox inoculation (then called va- 
riolation) (12). In the past 50 years, the study of 
infectious disease dynamics has grown into a rich 
interdisciplinary field. For example, decision- 
making for vaccination strategies increasingly 
depends on model analyses in which infection 
dynamics are combined with cost data (Box 2, 
Influenza: prevention and control). In recent 
decades, responses to major infectious disease 
outbreaks, including HIV, bovine spongiform 
encephalopathy (BSE), foot-and-mouth disease 
(FMD), SARS, and pandemic and avian influen- 
za, have shown both the need for and capabili- 
ties of models (Box 3, HIV: Test and treat strate- 
gy). Model-based analysis of such outbreaks also 
continually brings improvements in methodolo- 
gy and data, emerging from the comparison of 
model prediction with observed patterns. 
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For infectious agents important to public 
health, a series of principles has emerged for 
modeling infection dynamics (Table 1 and Box 
4). The basic reproduction number Ro, for ex- 
ample, is a central concept characterizing the 
average number of secondary cases generated 
by one primary case in a susceptible population. 
This concept highlights what must be measured 
to interpret observed disease patterns and to 
quantify the impact of selected control strate- 
gies (Fig. 1). 

Two fundamental properties of the world that 
shape infectious disease dynamics make com- 
putational tools key for understanding reality. 
The world is essentially a stochastic and high- 
ly nonlinear system. The nonlinearity derives 
not only from the complex interaction between 
factors involved in transmission, but also from 
the influence that the infection process has on 
the distribution of important characteristics at 
various temporal and spatial scales. This effect 
is seen in the age-related nature of infection and 
mortality in HIV changing the age distribution 
of the population, and in previous exposure to 
strains of influenza altering the distribution of 
influenza susceptibility. Such feedback mecha- 
nisms contribute to the nonlinearity of infection 
processes. Nonlinearity also leads to counter- 
intuitive phenomena (Fig. 2) and prevents sim- 
ple extrapolation of experience from one situa- 
tion to another, such as when deciding whether 
to implement a vaccination policy in different 
countries (Fig. 1). Mathematical tools, relating 
to data and processes on a large range of inter- 
acting scales, have become essential to explore, 
anticipate, understand, and predict the effects of 
feedbacks within such complex systems, includ- 
ing changes caused by intervention. 


Current and future opportunities for models 
in public health 

Over the past decade, key public health ques- 
tions, ranging from emergence to elimination, 
have posed a range of challenges for modeling 
infectious disease dynamics, many of which rely 
on leveraging disparate data sources and inte- 
grating data from a range of scales from genom- 
ics to global circulation. Given commonalities 
in processes across pathogens, progress made 
in one area can lead to advances in another. 
Progress in the areas described above all build 
on and inform each other, making this a dy- 
namic time for research in the discipline (13). A 
few themes are chosen to illustrate current 
trends in model development and public health 
application. 


Real-time outbreak modeling: 

The Ebola 2014-2015 outbreak 

The 2014-2015 outbreak of Ebola in West Af- 
rica serves to highlight both opportunities and 


challenges in modeling for public health. In 
the initial phase of this outbreak, real-time es- 
timates of the reproduction number or simple 
exponential extrapolation (14) allowed short- 
term predictions of epidemic growth that were 
used, for example, to plan for necessary bed 
capacity. Quantitative phylogenetic tools ap- 
plied to samples from initial victims provided 
important estimates of the origin of the out- 
break (15). Early mechanistic models that ex- 
plicitly took into account the roles played by 
different transmission routes or settings were 
informed by analysis of earlier outbreaks (16, 
17). When the failure to contain the epidemic 
with methods successful in previous outbreaks 
led to a scale-up of capacity driven by interna- 
tional aid, such models were used to assess the 
impact of, for example, reducing transmission 
at funerals (17) and whether the construction 
of novel types of treatment centers could end 
up doing more harm than good. Ensuring that 
the most effective combinations of interven- 
tions were implemented required close and 
fast interaction between modelers and policy- 
makers (18). Looking forward, models are now 
used to help clinical trial design and inform a 
debate on the optimal deployment of initially 
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scarce Ebola vaccines, once such vaccines be- 
come available. 

With the opportunities of real-time modeling 
for public health come specific challenges. The 
imperative to produce reliable and meaningful 
analysis for those treating infected people has to 
be balanced against the pressures and delays of 
scientific publication. In an ongoing outbreak, 
data can be patchy and reporting delayed, and 
different data sources are not always synthe- 
sized. When the Ebola outbreak expanded ex- 
plosively in the summer of 2014, data were often 
lacking on the effect on transmission dynamics 
of the various control measures that operated 
simultaneously in the hectic circumstances of 
the most severely hit areas. In any emerging 
epidemic, underreporting is a critical challenge 
for ongoing assessment of this epidemic and 
has had enormous impact on predictions of 
out-break size, but also of outbreak impact—for 
example, in terms of the case-fatality ratio (the 
proportion of cases that lead to death). Early in 
any outbreak, this estimate of severity can suffer 
from imprecise information on both the numer- 
ator (if not all deaths due to the infection are 
identified as such; for example, because health 
services are overwhelmed caring for the sick) 
and the denominator (if cases are not reported 
or, conversely, noncases get reported as cases if 
they are not laboratory-confirmed). This caused 
problems early in the H1N1 influenza outbreak 
first reported in in Mexico in 2009, as well as 
in the current Ebola outbreak. Although level 
of underreporting can be estimated from ret- 
rospective serological studies, it is usually not 
identifiable in real-time data. 

These limitations make it almost impossible 
to make reliable long-term predictions. Thus, 
modeling results are often based on scenarios in 
which a pathogen spreads unaltered by behav- 
ioral changes or the public health response. This 
rarely reflects reality, especially in such a dev- 
astating outbreak as Ebola, where the situation 
constantly changes owing to growing awareness 
in the community, as well as national and inter- 
national intervention. Careful communication 
of findings is key, and data and methods of anal- 
ysis (including code) must be made freely avail- 
able to the wider research community. Only in 
this way can reproducibility of analyses and an 
open exchange of methods and results be en- 
sured for maximal transparency and benefit to 


public health. 


Emergence of novel human pathogens 

There is an ever-present hazard that novel hu- 
man pathogens emerge from livestock and wild 
mammal and bird reservoirs. Research on po- 
tential emerging zoonoses draws on concepts 
from across the spectrum of infectious disease 
dynamics, disease ecology, microbiology, and 
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phylogenetic analysis. Particular challenges in- 
clude estimating human-to-human _transmis- 
sibility against a backdrop of ongoing zoonotic 
spillover, detecting anomalous outbreaks, and 
assessing the risk that more dangerous strains 
may arise through pathogen evolution. 

The recently identified gap in methodology 
for zoonoses with weak human-to-human trans- 
mission (6) is being filled with new approach- 
es for estimating R) and other transmission- 
related quantities from subcritical outbreak 
data (19-21). These studies address key public 
health concerns, but rely on strong assump- 
tions regarding the quality and completeness 
of case observations. Better information on 
surveillance program efficacy could be gained 
through serological surveys (where blood 
and saliva samples reveal evidence of past and 
present infections) or sociological study, and 
modeling studies can help to design and char- 
acterize efficient surveillance programs (22). 
Given the predominance of zoonotic patho- 
gens among emerging infections, models for 
transmission dynamics and evolution in multi- 
species ecosystems and food webs (consisting 
of host species and nonhost species interacting 
ecologically and epidemiologically) are a crucial 
area for future development (6, 23). The greatest 
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challenge—and the greatest prize—in modeling 
emerging zoonoses is to assess which diseases 
pose the most risk to humans and how these 
might change over time and in different locali- 
ties (24). Such tasks, which will join molecular 
studies to experimental infections to epidemio- 
logical and ecological surveys, will drive empir- 
ical and theoretical efforts for decades to come. 

The rising availability of pathogen genome 
sequence data, coupled with new computation- 
al methods, presents opportunities to identify 
with precision “who infects whom” and the 
networks of infection between humans and 
reservoirs (25). Full realization of this poten- 
tial, though, will require denser and more sys- 
tematic whole-genome sampling of pathogens 
coupled with associated epidemiological data, 
as well as baseline information on genetic di- 
versity and evolutionary rates, especially in an- 
imal hosts (26). 


Pathogen evolution and phylodynamics 

As pathogen genetic data become increasingly 
available, modelers are finding ways to synthesize 
these new data streams with more traditional epi- 
demiological information in phylodynamic tools 
(27, 28). However, current frameworks employ 
compartmental epidemiological models, which 


Box 1. Quantitative tools in infectious disease dynamics. 


Here, we use the words “computational tools” loosely. In infectious disease dynamics, there is a 
broad range of relevant quantitative tools, and we refer to the entire collection. It comprises 
statistical methods for inference directly from data, including methods to analyze sequencing and 
other genetic data. This leads to estimates of important epidemiological information such as 
length of latency, incubation and infectious periods and their statistical distributions, inferred 
transmission chains and trees early in outbreaks, the risks related to various transmission routes, 
or estimates of rates of evolution. Mathematical models in the strict sense refer to mathematical 
descriptions of processes thought to be associated with the dynamics of infection—for example, in a 
population or within an individual. Such models take many forms, depending on the level of bi- 
ological knowledge of processes involved and data available, and depending on the purpose. 

Several classes of model are used, spanning the spectrum of information available. At one end 
of the range are detailed individual-based simulation models, where large numbers of distinct 
individual entities (with their own characteristic traits such as age, spatial location, sex, immune 
status, risk profile, or behavior pattern) are described in interaction with each other, possibly in a 
contact network, and with the infectious agent. At the other end are compartmental models where 
no individuals are recognized, but only states for individuals (for example: susceptible, infectious, 
immune) aggregated into compartments where everyone has the same average characteristics 
and where interaction is typically uniform (everybody interacts with everybody else). Such models 
do not describe the disease history of single individuals, but rather the time evolution of aggre- 
gated variables, such as the number of individuals that are currently susceptible. 

Mathematical models can have both mechanistic parts in their description, based on assump- 
tions about biological mechanisms involved, and more phenomenological parts, where there is a 
statistical or presumed relation between variables, without clear assumptions from which this re- 
lation can be derived. An example of the former is the assumption of mass action to describe 
interaction between individual hosts; an example of the latter is an empirical relation between the 
length of an infectious period in a mosquito and environmental temperature. 

For infectious disease dynamics, our world is clearly stochastic, in that chance events play a role in 
many of the processes involved. Certainly at lower levels of biological aggregation, chance dominates— 
for example, in infection of individual cells or in contacts individual hosts make. At higher aggregation 
levels, many cells or individuals interact, and chance effects may average out to allow deterministic 
descriptions. There are purely stochastic models, purely deterministic models, and models that are 
mixed. It is important to point out that, even though the world is stochastic, stochastic descrip- 
tions are not by definition better than deterministic descriptions. Both are still models of reality, 
and the fact that chance plays a role may have a far less significant influence on model outcome 
and prediction than choices made in the relations between ingredients and variables. 

Areas of rapid growth are statistical and numerical methods and tools to estimate model pa- 
rameters from, often scarce, mismatched or incomplete data, and to contrast model output with 
real-world observations. 
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do not make efficient use of individual-level ep- 
idemiological data. Although sampling theory is 
well developed for standard surveillance data, the 
relationship between a set of pathogen sequenc- 
es and the phylogeny inferred from a population 
sample is more complex (11). Many-to-one map- 
ping possibilities between, on the one hand, com- 
binations of epidemiological, immunological, 
and evolutionary processes shaping sequences 
and, on the other hand, the inferred phylogeny, 
demand the integration of diverse data sources 
and an increased focus on systematic sampling. 
Phylodynamic studies to date have largely fo- 
cused on fast-evolving RNA viruses, driven by 
the large amount of data generated for clinical 
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Fig. 1. Modeling for public health exemplified by rubella. (A to F) Policy questions are formulated; available data 
are brought to bear on the question. In this example, the incidence of rubella is shown following the introduction of 
vaccination in individuals under 15 or 15+ years of age in Costa Rica (127). Application of a nonlinear age-structured 
SIR model (see Box 4) to these circumstances led to the collection of key missing data. In the bottom right-hand plot, 
each square depicts a combination of birth rate and infant vaccine coverage reflecting different countries (e.g., 
Somalia depicted by a diamond and Nepal by a circle), colored by expected effect on congenital rubella syndrome 
(CRS) in newborns, related to local Ro (128). This translates into confidence that routine vaccination is likely to 


[e.g., hepatitis C virus (HCV) or HIV)] or surveil- 
lance (e.g., influenza) purposes (11). Replicating 
these efforts on an expanded array of pathogens, 
including DNA viruses, bacteria, fungi, protozoa 
(e.g., malaria), and helminths, is a promising av- 
enue for future research (29). It is of particular 
importance in the context of the evolution and 
spread of drug-resistant variants and vaccine es- 
cape mutants. However, genome-wide pathogen 
data also present challenges, in particular in rela- 
tion to accommodating recombination, reassort- 
ment, and mobile genetic elements. Analysis of 
bacterial genomes usually considers only those 
genes that are shared across taxa, but there are 
good reasons to believe that noncore genes play 
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an important role in bacterial evolution, includ- 
ing the evolution of antibiotic resistance (30). 

Although sequence data are extremely valu- 
able, to link these data fully to disease dynamics, 
it will be important to determine how sequence 
changes affect functions related to pathogen fit- 
ness, such as replication rate, transmissibility, and 
immune recognition. Molecular epidemiological 
studies often treat pathogen genetic variation as 
simply reflecting the underlying transmission 
process, whereas in reality such variation may 
play an important role in determining transmis- 
sion dynamics, as exemplified by escape from 
herd immunity by influenza A virus (31). 


“Deep” sequencing of pathogens with- 
in individual hosts _gener- 
ates information on within- 


host diversity, resulting from evolu- 
tion within the host (often in response 
to drug treatment), or multiple infec- 
tions. To tackle within-host diversity, 
models that embed pathogen evolu- 
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coverage sufficiently high. 


Consider vaccine heterogeneity 
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tion within a transmission tree are 
needed. Such models, which cross 
the within- and between-host 
scales, are only just becoming an- 
alytically and computationally 
feasible despite being proposed 
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reduce the public health burden caused by CRS in Nepal (green), but not in Somalia (red). 
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several years ago (32). Similarly, 
although progress has been made 
in scaling inference from genes to 
genomes (33), scaling inference to 
large numbers of sequences is lag- 
ging far behind. 


Multiple infections 
Infectious disease epidemiology 
evolved by focusing on interactions 
between a single host species and a 
single infectious agent. It is becom- 
ing increasingly clear that multiple 
= agents simultaneously infecting the 
“{> same host populations and individ- 
uals appreciably add to the public 
health burden and complicate pre- 
vention and control. Coinfections in 
relation to HIV— for example, tuber- 
culosis and HCV—or coinfection of 
different strains of influenza A virus 
raise important public health and 
evolutionary issues. Multiple agents 
infecting the same host individual 
have been shown to influence each 
other by increasing or decreasing 
susceptibility and/or infectivity of 
that individual, thereby influencing 
the population dynamics of these 
agents in ways that we have yet to ex- 
plore and understand (34, 35). 
Multiple infections of the same 
individual with closely related patho- 


confidence in a reduction in CRS cases 
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gens occur when infection elicits no immunity, 
or only a partial immune response. Macropar- 
asites, including many of the important human 
helminth infections, are good examples of patho- 
gens that evade human immune responses and 
cause repeated infection of the same host (36). 
Biological mechanisms giving rise to such mul- 
tiple infections include sequential reinfections 
caused by antigenic drift in influenza, antigenic 


variation in respiratory syncytial virus (RSV), 
and waning (slow loss of) immunity in pertussis, 
while lack of cross-protection in many coloniz- 
ing microparasites— for example, pneumococ- 
cus and human papilloma virus (HPV)—allows 
for multiple concurrent infections. Although the 
existence of reinfections is a clinical fact, popu- 
lation-level data are scarce as reinfections are of- 


ten subclinical and individual-based longitudi- 
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nal infection histories are often only anecdotal. 
Results from new analytical approaches relating 
to deep sequencing and neutralization tests cov- 
ering multiple antigens are being utilized (37). 
The immunodynamics of influenza have 
clear policy implications for the identification of 
high-risk groups in connection with pandemic 
planning (38), while the dynamics of waning 
immunity are key to the current concerns about 
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Fig. 2. Examples of counterintuitive effects of nonlinear infection dy- 
namics. (A) Nonlinear interaction between prevalence of a helminth infec- 
tion and infection pressure (as measured by the mean intensity of existing 
infections) means that control measures must have a disproportionately large 
impact on intensity before prevalence is reduced. This effect is predicted by a 
mathematical model (solid line) and corroborated by field data (crosses) (129). 
(B) Nonlinear relation between total number of cases of congenital rubella 
syndrome (CRS) and rubella vaccine coverage, showing that suboptimal levels of 
vaccine coverage cause worse health outcomes than no vaccination [adapted 
from (130)]. The line shows model predictions; similar effects have been docu- 
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mented for real rubella control situations (131). (© and D) Modeling results of 
rebound of gonorrhea transmission with different treatment strategies 
without (C) and with (D) antimicrobial resistance developing [adapted from 
(132)]. In the presence of resistance, focusing treatment on the high-risk core 
group leads to an increase in prevalence approaching that of untreated base- 

line prevalence, after an initially strong decline for more than a decade. (E) 

Box plot from field data of a nonlinear relation between Ro for dengue trans- 

mission and average dengue hemorrhagic fever (DHF) incidence across 
Thailand, showing that control measures that reduce Ro may paradoxically 
increase cases of DHF [adapted from (133)]. 
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immunization level for pertussis (39). Multi- 
valent vaccines covering only a targeted subset 
from the circulating strains of pneumococcus 
and HPV pose important new applied problems 
(40). The spread of recombinant viruses implies 
the existence of multiple infections. One exam- 
ple is the Sydney 2012 strain of norovirus, but 
how this can occur in such an acute infection 
remains to be understood, as the time window 
for multiple exposures is limited, unless sub- 
clinical or environmental reservoirs of infection 
are important. Mathematical models could help 
to explore how, for example, such subpopula- 
tions may contribute to the dynamics of mul- 
tiple infections. 


Behavior of hosts 

Human behavior is a fundamental determinant 
of infectious disease dynamics, whether by af- 
fecting how people come in contact with each 
other, vaccination coverage, reporting biases, 
or adherence to treatment. Traditional epidem- 
ic models have tended to ignore heterogeneity 
in contact behavior [although early HIV mod- 
els addressed heterogeneity in sexual behavior 
by necessity (41)]. Increasing sophistication 
of contact network models (42), together with 
data on epidemiological contacts, creates op- 
portunities for understanding and controlling 
transmission at a fundamental level (43) and 
opens up the possibility of independent study 
of relevant social factors (10). Recent years 
have seen exciting developments in the mea- 
surement of contact patterns and “who might 
infect whom” through advances in individual 
electronic identification technology. This is a 
promising avenue for linking pathogen genetic 
data and human behavior. 

Contact patterns are not static and can shift 
during outbreaks as individuals change their be- 
havior in response to perceived risk and public 
health interventions (44). Modeling has illumi- 
nated this process—for example, by the incorpo- 
ration of peer influence on vaccination behavior 
into models of infectious disease dynamics (45, 
46). Analysis of data from online social net- 
works has also created promising opportunities 
to validate such approaches with empirical ob- 
servations (47, 48). 

Movement and travel are tightly linked 
to the spread of infection and have been ex- 
plored through models to highlight commut- 
ing and agricultural migration driving local 
disease transmission (49) and global disease 
patterns through air travel (50). These proc- 
esses are now being investigated to gain in- 
sights into the more complex case of vector- 
borne diseases, such as malaria and dengue, 
where both host and vector movement can 
interact to drive local (51) and large-scale dy- 
namics (52). 
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Elimination and eradication 

Modeling has long provided support for elimi- 
nation efforts: Vector control (53), critical com- 
munity size (54), herd immunity, and critical 
vaccination threshold (55, 56) were all powerful 
insights from models framed in relatively simple 
and homogeneous terms. Subtleties and com- 
plexities in many current eradication programs, 
as well as the availability of novel data sources, 
have called for a range of extensions in the the- 
ory. As we approach elimination targets, disease 
dynamics have changed in ways that were largely 
predicted by models, but also in unanticipated 
ways as a result of ignorance about key epidemi- 
ological processes (3). 


Incentives for control efforts also change, 
both at the individual level [passive or active re- 
fusal to participate can develop (57)] and at the 
country level (58). This reinforces the call for 
development of models of human behavior and 
its interaction with infectious disease dynam- 
ics (9) potentially drawing on new data sources 
from social media (59, 60), as well as for models 
that can capture national and nongovernmental 
motivations, interactions, and competition, eco- 
nomical or otherwise. Long-term control puts 
pathogens under strong selection for resistance, 
calling for evolution-proof control methods (61) 
and novel vaccine technologies and their opti- 
mized delivery (62). 


Box 2. Influenza: prevention and control. 


Human influenza—pandemic and seasonal—remains a major issue in public health owing to 
the continued emergence of novel genetic strains, and one where models have successfully 
addressed questions from basic biology to epidemiology and health policy. In recent years, 
modeling and other quantitative analysis has been used to study at least three major issues: 
pandemic preparedness and mitigation strategies (84-89), rethinking vaccination strategies 
for seasonal influenza (70), and improved methods in phylodynamics and influenza strain 
evolution (11). Recent models of influenza fitness have also been developed to predict viral 
evolution from one year to the next, providing a principled and more precise method for the 
vaccine selection required every year (90). 

For seasonal influenza, models have played a key role in providing the scientific evidence base 
for vaccination policy, making use of the information in multiple, often unavoidably biased, data 
sources such as syndromic time series, vaccine coverage and efficacy, economic costs, and con- 
tact patterns in the population. For example, a combined epidemic and economic model was fitted 
to fine-grained data from many sources to describe the dynamics of influenza in the United 
Kingdom, and the influence of previous vaccination programs (70). With confidence in the model's 
predictions based on its ability to capture past patterns, it was used to look at alternative vac- 
cination strategies and led to a new national policy to vaccinate school-age children (91). Targeting 
those individuals most likely to spread the virus, rather than only those most likely to suffer the 
largest morbidity, is a marked departure from established practice in the UK and is currently under 
consideration elsewhere (92). 


Box 3. HIV: test and treat. 


Mathematical modeling has played a central role in our understanding of the HIV epidemic, and 
in informing policy from the outset of our recognition of the pandemic (93). Some of the many 
insights include a model-based analysis of viral load data from inhibition experiments, which 
revealed the rapid and ongoing turnover of the within-host viral population (94), and the use of 
phylogenetic models to show that the HIV pandemic did not emerge in the 1980s, but had its roots 
in the early 20th century (95). 

A key contribution of mathematical modeling has been to identify when viral transmission 

occurs over the course of infection, which determines the potential to halt spread by various 
measures. Models have shown that transmission of HIV depends on the epidemic phase and 
the sexual behavior of the population, and a large proportion of transmissions may occur late in 
infection (96). Model-based inference in the Netherlands also suggested that the effective 
reproduction number (Box 4) had fallen below 1 due to a combination of low-risk behavior and a very 
effective diagnosis and treatment program (97). The debate was transformed in the mid-2000s, when 
eradication of HIV through a “test and treat” strategy was hypothesized (98, 99). Subsequent trial 
results showing that antiretroviral treatment (ART) of HIV-positive individuals could practically 
eliminate transmission within sexual partnerships when the index case is treated (100) have 
further supported the role of treatment as prevention. Although these findings have not dispelled 
concerns about transmission early in infection (93), or about extra-couple transmission (101), it is 
suggested that high population coverage of ART may have reduced the incidence of HIV infection 
in rural KwaZulu-Natal, South Africa (102). 
These findings, combined with the prospect of cheaper, more effective drugs and delivery 
structures, underpin UNAIDS’ goal of “zero new infections” for HIV and the initiation of a multi- 
million dollar cluster-randomized trial (103), which will have its outcome assessed against model 
predictions. In the meantime, the scientific discussion of the effectiveness of ART in preventing 
transmission continues, sparked by studies that fail to show a decline in incidence after increased 
treatment (104). Such debates are essential to elucidate areas for improvement of the models 
used and data needs for such improvement, and to highlight methodological limitations (105). 
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Finally, since the era of smallpox eradica- 
tion, patterns of global disease circulation have 
changed radically. Human mobility and migra- 
tion are increasing global connectivity, strength- 
ening the need for cooperation and interna- 
tional synchronization of efforts (as illustrated 
by polio). Techniques for analysis of novel data 
sources are again key here; e.g., mobile phone 


Table 1. Principles for modeling infection dynamics. As different infections 
have become the focus of public health attention, the modeling community 
has responded by developing improved concepts and methods. The table 
concentrates on the period since 1950. The first column lists the classes of 
infection, and the second column lists factors whose importance to infection 


Motivating studies 


Malaria 
[1910s, 1950s onwards, 
(53, 106, 107)] 


Important factors 


records provide unique opportunities to under- 
stand disease source-sink dynamics (52). 


Computational statistics, model fitting, 

and big data 

By definition and design, models are not reality. 
The properties of stochasticity and nonlinear- 
ity strongly influence the accuracy of absolute 


dynamics became partic 
and fourth columns high 


Concepts 


Transmission via insect vectors; Threshold for control, basic 
nonlinear dependence of 
transmission on mosquito 


reproduction number. 


biting rate; influence of 
environmental and climatic 


variables. 
Childhood infectious 
diseases, e.g., measles. 
[1950s onwards, 
(54, 108)] 
Macroparasites. 
[1970s onwards, (108)] 


infections. 
Sexually transmitted 
infections, e.g., HIV. 
[1980s onwards, 
(108, 109)] 


Veterinary outbreaks, 
e.g. BSE and FMD. 
[1990s onwards, 
(110, 111)] 


Novel emerging 
infections, e.g., SARS, 
Nipah virus, MERS. 
[2000 onwards, 

(6, 19, 112-115)] 


Influenza, including 
avian influenza. 


Immunizing infections; spatial 
and temporal heterogeneity; 
demography; age structure; 
household structure. 

Clumped infections, multistrain Consequences of 
and multispecies infections, 
cross immunity, concurrent 


Fixed spatial locations with 
changing contact networks. 


Behavior change; global 
interconnectedness and 
international cooperation 
in control; responses in 
absence of biomedical 
measures; animal reservoirs. 

Distribution of prior immunity; Pandemics; spillover 
within-population and species 

[Present, (27, 28, 116-118)] strain differences, virus 


Critical community size and 
herd immunity; periodic 


efficacy. 


overdispersed distribution 
of parasite load (Fig. 1) 


High/low risk groups; nonrandom Incubation and infectious 
contact structure; partnerships; 
within-host strain diversity and 
evolution; time scale. 


period distribution; core 
group; next-generation 
matrix and operator; 
partnership dynamics. 
Local versus long-range 
transmission; spatial 
intervention (ring 
vaccination/culling); 
conflict of priorities at 
different scales. 
Zoonotic spillover; stuttering 


case; superspreaders; 
unobserved dynamics in 
an animal reservoir; 
supershedding 


between wild birds and 


evolution and interaction; 
role of wildlife and farm 


animals. 

Vector-borne diseases, 

e.g., dengue, malaria. 
[Present, (119-122)] 


The influence of climate and 
environment on vector and 
pathogen development; 
animal reservoirs; interaction 


Dilution effect and role of 
biodiversity in infectious 
disease dynamics; 
reemerging infections. 


between strains within- 
host and between-host. 


Bacterial infections, e.g., 
pneumococcal disease, 
MRSA, and tuberculosis. 
[Present, (39, 123-126)] 
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Antibiotic/drug resistance; 
adaptive dynamics. 


Vaccine effectiveness; 
interacting natural 
immunity and vaccine 
boosting. 


outbreaks; fade out; vaccine 


chains; importance of index 


farmed birds; phylodynamics. 
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predictions over long time horizons. Even if the 
mechanisms involved are broadly understood 
and relevant data are available, predicting the 
exact future course of an outbreak is impossi- 
ble owing to changes in conditions in response 
to the outbreak itself, and because of the many 
chance effects in play. These stochastic effects 
dominate developments in situations with rel- 


ularly clear in relation to those infections; the third 
ight concepts and methods that were developed in 
response. For each row, only a few typical references are given. Many factors, 
concepts, and methods are relevant, in current use, and in continual develop- 
ment for much larger classes of infectious agents. 


Methods 


Models with two host species (host- 
vector models); using models to 
support and guide field campaigns; 
relating models to field data. 


SIR models; age-structured models; 
models with periodic forcing; spatial 
and stochastic models; metapopulation 
models; time-series models. 

Stochastic models, approximations 
including hybrid models and 
moment closure. 


Statistical methods (e.g., back 
calculation); models with 
(dis)assortative mixing; pair-formation 
models; within-host dynamic models. 


Individual-based models and spatial 
simulations (FMD); data-driven 
real-time modeling; inference of 
transmission trees. 


Contact tracing; modeling international 
spread and control; quarantine 

and case isolation; individual 
heterogeneity in infectiousness, 
ncubation, and latency period. 


Interaction between immunological 
and epidemiological dynamics; 
integrating phylogenetic and 
epidemic methods and models. 


Evolutionary impact of vaccines/other 
interventions; synthesis of data 
from ecology and epidemiology; 
elimination modeling; statistical 
modeling of environmental vector 
suitability. 

Modeling interacting and emerging 
strains; stochastic models in small 
populations. 
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atively few infected individuals that occur at 
emergence, approaching the threshold for sus- 
tained host-host spread, or approaching elimi- 
nation and eradication. This makes it virtually 
impossible to predict which infectious disease 
agent is going to emerge and evolve next and 
where, or to predict when and where the next 
or last case in an outbreak will occur. There is, 
typically in complex systems, a fundamental 
horizon beyond which accurate prediction is 
impossible. The field has yet to explore where 
that horizon is and whether computational 
tools and additional data (and if so which data) 
can stretch predictions to this limit. In contrast, 
“what-if” scenarios for public health interven- 
tion can provide qualitative (and increasingly 
semiquantitative) insight into their population 
consequences. 

With growing applications in public health, 
there is an increasing demand to validate mod- 
els by making model predictions consistent 
with observed data. The development of ever- 
more-powerful computers is accompanied by 
new techniques utilizing this power, notably for 
statistically rigorous parameter estimation and 
model comparison. Techniques such as Markov 
chain Monte Carlo (MCMC) have become firmly 
established tools for parameter estimation from 
data in infectious disease models [e.g., (63)], 


and Monte Carlo based methods will play a piv- 
otal role in addressing the challenges that lie in 
reconciling predictions and observations (64, 
65). Other techniques, such as so-called parti- 
cle filters, approximate Bayesian computation, 
emulation, and their combinations with MCMC 
[e.g., (66)], are rapidly developing and allow sto- 
chastic models that explicitly account for incom- 
plete observations to be matched to time series 
of cases, giving insights into scenarios as diverse 
as cholera in Bangladesh (67) and influenza (68, 
69). The need to integrate multiple data sourc- 
es (70, 71), as well as to include uncertainty in 
model parameters and/or structure, has driven 
the use of Bayesian approaches. 

Although the rapid expansion of infectious 
disease models and their application over the 
past decade has coincided with an increase in 
open access data sets available from a variety 
of sources, progress in data capture needs to be 
accelerated. Although some of these techno- 
logically advanced data streams have been in- 
corporated into models—for example, to track 
the incidence of influenza in the United States 
(72), to elucidate the spatial dynamics of measles 
and malaria in Africa (53, 73), and to chart the 
spread of dengue globally (74)—much more re- 
mains to be done to leverage data collected from 
different sources (e.g., demographic, genetic, 


Box 4. Some fundamental terms and concepts. 


* susceptible: individual who is at risk of becoming infected if exposed to an infectious 


agent. 


* basic reproduction number, Ro: average number of infections caused by a typical infected 
individual in a population consisting only of susceptibles; if Ro > 1, the infectious agent can 


start to spread. 


* effective reproduction number, R.: average number of infections caused by a typical 
infected individual when only part of the population is susceptible; as long as R, > 1, the agent 


can continue to spread. 


¢ herd immunity: state of the population where the fraction protected is just sufficient to 


prevent outbreaks (R, < 1). 


* critical elimination threshold, p,: proportion of the susceptible population that needs to 
be successfully protected—for example, by vaccination—to achieve herd immunity; p, = 1: 1/Ro 
is a rule of thumb from models when hosts are assumed to mix randomly. 


* force of infection: per capita rate at which susceptible individuals acquire infection. 


¢ final size: fraction of the initial susceptible population that eventually becomes infected 


during an outbreak. 


* prevalence: proportion of the population with infection or disease at a given time point. 


* superspreader/supershedder: infected individual that produces substantially more new 
cases than the average because of greater infectiousness, longer duration of infectiousness, 
many more transmission opportunities and contacts, or combinations of these. Even when the 
average Ro is relatively small, these individuals have large effects on outbreaks. 


* metapopulation: collection of populations, separated in space, but connected through 


movement of individuals. 


* critical community size: minimum number of individuals in a population that allow an 
infectious agent to persist without importation of cases. 


* case fatality ratio: proportion of symptomatic infections that result in death. 


* SIR model: most basic model metaphor for immunizing infections where each living individual 
is assumed to be in one of three epidemiological states at any given time: susceptible, infected and 
infectious, and recovered and immune. The model specifies the rates at which individuals change 
their state. Individuals progress from S to | when infected, then from | to R upon recovery. Many 
variants exist—for example, recognizing different classes of S, |, and R individuals, depending on 


individual traits such as age. 
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epidemiological, treatment, and travel patterns) 
and at different temporal and spatial scales. 


Concluding remarks 

Infectious diseases are an important frontier in 
public health, and their prevention and control 
call for global, rather than national or region- 
al, coordinated efforts (75-78). The success of 
smallpox and rinderpest eradication campaigns 
shows the possibilities; the global spread of new- 
ly emerged pathogens (recently avian influenza 
strains and MERS coronavirus), the difficulties 
in curbing the spread of antibiotic resistance, 
the upsurge of polio toward the “end-phase” of 
its eradication, and the recent unprecedented 
spread of Ebola virus, are examples that show 
the need for international coordination and 
collaboration. Nonlinearity in infectious dis- 
ease dynamics and global connectivity cause 
suboptimal national decisions on control and 
prevention to have regional and even global re- 
percussions. 

Given the mismatch with regions where most 
expertise on infectious disease dynamics is con- 
centrated, it is important to empower local sci- 
entists and policy-makers, in regions where the 
burden of disease is heaviest, about the problems 
facing their own countries and the consequenc- 
es of local actions. It is essential to make exper- 
tise, data, models, statistical methods, and soft- 
ware widely available by open access. There are 
several initiatives (e.g., Thiswormyworld.org, 
Garkiproject.nd.edu, EDENextdata.com, the 
Humanitarian Data Exchange (HDX), and the 
Malaria Atlas Project), but more needs to be 
done. Modeling tools and software for data anal- 
ysis are beginning to become open source, such 
that findings can be replicated, additional sce- 
narios can be evaluated, and others can incor- 
porate methods for data analysis or simulation. 
Ultimately, sharing models guarantees more 
reproducible results, while maximizing model 
transparency. 

Making data sets widely available is also cru- 
cial, for example, to support replication of find- 
ings and broader comparative analyses (79). As 
models become open access, so should much 
of the data collected by governments, inter- 
national agencies, and epidemiology research 
groups. Two outbreaks never occur in exact- 
ly matching circumstances, even for the same 
infectious agent, so there is potential to study 
many outbreaks in parallel to gain insight into 
the determinants of outbreak pattern and sever- 
ity. Looking forward, there is a major opportu- 
nity to design experiments, clinical trials [for 
example for vaccines (80)], and surveillance 
protocols to test model predictions or assump- 
tions, and to help reduce or better target the 
enormous costs involved. By integrating model- 
ing approaches throughout the full life cycle of 


sciencemag.org SCIENCE 


infectious disease policies, including economic 
considerations (58, 70, 81), health outcomes can 
be improved and scientific understanding can 
be advanced. 

At present, the evidence provided by infec- 
tious disease models is not considered by the 
Grading of Recommendations Assessment, De- 
velopment and Evaluation (GRADE) working 
group (www.gradeworkinggroup.org) alongside 
that of conventional studies such as clinical tri- 
als. Regardless, models are essential when di- 
verse sources of data (including GRADE-scale 
evidence) need to be combined and weighed to 
assess quality of evidence and recommendations 
made in health care scenarios. In many cases, 
the definitive trial cannot be performed, and in 
such circumstances models can offer insight and 
extract maximum value from data that are avail- 
able. In recent years, uniformity of practice and 
quality control for models has received more 
attention, resulting in initial attempts to char- 
acterize good modeling practice for infectious 
diseases (82, 83). 

The optimal use of models to inform policy 
decisions requires a continuous dialogue be- 
tween the multidisciplinary infectious disease 
dynamics community and decision-makers. 
This is increasingly understood by governments 
in developed countries, in nongovernmental 
agencies and by large funding bodies. This di- 
alogue will help to reduce the burden from in- 
fectious diseases by providing better-informed 
control strategies. Mathematical models will 
allow us to capitalize on new data streams and 
lead to an ever-greater ability to generate robust 
insight and collectively shape successful local 


and global public health policy. 
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Computer scan uncovers 100,000 new viruses 
Clues to future outbreaks may be hidden in existing genomic databases 


By Elizabeth Pennisi 


t took just one virus to cripple the world’s 

economy and kill millions of people; yet 

virologists estimate that trillions of still- 

unknown viruses exist, many of which might 

be lethal or have the potential to spark the 

next pandemic. Now, they have a new—and 
very long—list of possible suspects to interrogate. 
By sifting through unprecedented amounts 
of existing genomic data, scientists have 
uncovered more than 100,000 novel virus- 
es, including nine coronaviruses and more 
than 300 related to the hepatitis Delta virus, 
which can cause liver failure. 

“It’s a foundational piece of work,” says 
J. Rodney Brister, a bioinformatician at the 
National Library of Medicine. The study, 
published last week in Nature, expands the 
number of known viruses that use RNA: 
instead of DNA for their genes by an order 
of magnitude. It “demonstrates our outra- 
geous lack of knowledge about this group 
of organisms,” says disease ecologist Peter 
Daszak, president of the EcoHealth Alli- 
ance, a nonprofit research group in New 
York City that is raising money to launch a 
global survey of viruses. 

Scientists predict the study will also 
help launch so-called petabyte genom- 
ics—the analyses of previously unfath- 
omable quantities of DNA and RNA data. (One 
petabyte is 10° bytes.) That wasn’t exactly what 
computational biologist Artem Babaian had in 
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mind when he came up with the project while 
in between jobs in early 2020. Instead, he was 
simply curious about how many coronavirus- 
es—aside from the virus that had just launched 
the COVID-19 pandemic—could be found in 
sequences in existing genomic databases. 

So, he and independent supercomputing 
expert Jeff Taylor scoured cloud-based ge- 


In a vast repository of genetic sequences, scientists found nine 
unknown coronaviruses, relatives of SARS-CoV-2 (computer model). 


nomic data that had been deposited to a glob- 
al sequence database and uploaded by the U.S. 
National Institutes of Health. As of now, the 
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database contains 16 petabytes of archived se- 
quences, which come from genetic surveys of 
everything from fugu fish, the risky Japanese 
delicacy, to farm soils to human guts. (A data- 
base with a 5-megabase digital photo of every 
person in the United States would take up about 
the same amount of space.) The sequences also 
capture the genomes of viruses infecting differ- 
ent organisms in samples, but the viruses 
usually go undetected. 

To sift through the reams of data, 
Babaian and Taylor devised a set of com- 
puter search tools specialized for cloud- 
based data. With the help of several 
bioinformaticians, some whom became 
collaborators on the project, they tweaked 
the new software to make their analysis 
“way faster than anyone thought possible,” 
recalls Babaian, who is now at the Univer- 
sity of Cambridge. 

They soon expanded the viral hunt 
beyond coronaviruses and looked at all 
the data in the cloud. Babaian and his 
colleagues’ programs hunted among the 
cloud’s sequences for matches to the cen- 
tral core of the gene for RNA-dependent 
RNA polymerase, which is key to the rep- 
lication of all RNA viruses. Such viruses 
include not only coronaviruses, but also 
those that cause flu, polio, measles, and 
hepatitis. 

Babaian’s approach was fast enough to work 
through 1 million data sets a day—at a com- 
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puting cost of less than 1 cent per data set. “It’s 
an impressive engineering feat; says C. Titus 
Brown, a bioinformatician at the University of 
California, Davis. When the researchers were 
finally finished, they had uncovered the partial 
genomes of almost 132,000 RNA viruses. 

The group’s new database doesn't have the 
complete sequence of each new virus— in many 
cases, there's just the gene for the core enzyme. 
But researchers can use even partial sequences 
to build family trees that reveal how different vi- 
ruses are related. In some cases, they can also use 
the database to find out where around the world 
a particular virus was found—and what type of 
host it was in. And some of the discovered virus- 
es could help researchers better understand how 
human pathogens arise, Brown says, or improve 
diagnostic tests for infections. Finally, when a 
new virus is isolated from a sick patient, a scan 


HIV 


of the genomic database could show whether it 
was already present elsewhere. “We have turned 
this [database] into a giant virus surveillance 
network, Babaian says. 

Some findings were unexpected, including 
new coronaviruses in the well-studied fugu 
fish and in the axolotl, an amphibian that is a 
common lab organism. In a few cases, research- 
ers could piece together whole genomes for the 
viral finds. And in some aquatic animals, those 
sequences suggested their novel coronavirus ge- 
nomes are spread across two separate RNA mol- 
ecules, not the usual single strand, Babaian and 
his colleagues report. 

Babaian’s team also came across evidence of 
more than 250 giant bacteriophages—viruses 
that infect bacteria—that resemble ones already 
known in algae. These “huge phages” were de- 
tected in sequences from vastly different organ- 


A highly virulent variant of HIV-1 
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We discovered a highly virulent variant of subtype-B HIV-1 in the Netherlands. One hundred nine 
individuals with this variant had a 0.54 to 0.74 log,, increase (i.e., a ~3.5-fold to 5.5-fold increase) 
in viral load compared with, and exhibited CD4 cell decline twice as fast as, 6604 individuals with 
other subtype-B strains. Without treatment, advanced HIV—CD4 cell counts below 350 cells per 
cubic millimeter, with long-term clinical consequences—is expected to be reached, on average, 9 
months after diagnosis for individuals in their thirties with this variant. Age, sex, suspected mode 
of transmission, and place of birth for the aforementioned 109 individuals were typical for HIV- 
positive people in the Netherlands, which suggests that the increased virulence is attributable to 
the viral strain. Genetic sequence analysis suggests that this variant arose in the 1990s from de 
novo mutation, not recombination, with increased transmissibility and an unfamiliar molecular 


mechanism of virulence. 


isms. One group of huge phages was found in a 
per-son in Bangladesh and also in cats and dogs 
in the United Kingdom, for example. These vi- 
ruses are big enough to carry genes between dif- 
ferent hosts species, suggesting they might pro- 
vide a new source of genetic changes, Babaian 
notes. That’s the way it is with viruses, Daszak 
says. “Every time we start digging, we get sur- 
prises.” 

To make sure others can take advantage of 
the work, Babaian’s team has created a public 
repository of the tools it developed, along with 
the results. The amount of cloud-based, pub- 
licly available DNA sequences is expanding 
exponentially; if he did the same analysis next 
year, Babaian expects he would find hundreds 
of thousands more RNA viruses. “By the end of 
decade, I want to identify over 100 million” & 


he risk posed by viruses evolving to 

greater virulence—ie., causing greater 

damage to their hosts—has been exten- 

sively studied in theoretical work despite 

few population-level examples (1-3). 

The most notable recent example is the 
B.1.617.2 lineage (Delta variant) of severe acute 
respiratory syndrome coronavirus 2 (SARS- 
CoV-2), for which an increased probability 
of death has been reported (4-6), as well as 
increased transmissibility (7, 8). RNA viruses 
have long been a particular concern, as their 
error-prone replication results in the greatest 
known rate of mutation—and thus high po- 
tential for adaptation. Greater virulence could 
benefit a virus if it is not outweighed by re- 
duced opportunity for transmission. These 
antagonistic selection pressures may result in 
an intermediate level of virulence being opti- 
mal for viral fitness, as observed for HIV (9). 
Concrete examples of such evolution in action, 
however, have been elusive. Continued moni- 
toring of HIV virulence is important for global 
health: 38 million people currently live with the 
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virus, and it has caused an estimated 33 million 
deaths (www.unaids.org). 

The main (M) group of HIV-1, responsible 
for the global pandemic, first emerged around 
1920 in the area of what is now Kinshasa, Dem- 
ocratic Republic of the Congo (10), and had 
diversified into subtypes by 1960 (11). The 
subtypes, and the most common circulating 
recombinant forms (CRFs) between the sub- 
types, took different routes for global spread, 
establishing strong associations with geogra- 
phy (12), ethnicity, and mode of transmission. 
Differences in virulence between subtypes and 
CRFs have been reported, though it is challeng- 
ing to disentangle genotypic effects on virulence 
from confounding effects while retaining large 
sample sizes, given the strong associations be- 
tween viral, host, and epidemiological factors 
(13). The co-receptor used for cell entry has 
long been understood to affect virulence (14, 
15), and this has been proposed as a mechanism 
that underlies differences in virulence between 
subtypes and CRFs (13), as well as one reported 
difference within a CRF (16). 

HIV-1 virulence is most commonly mea- 
sured by viral loads (the concentration of viral 
particles in blood plasma) and CD4 counts (the 
concentration of CD4+ T cells in peripheral 
blood, which tracks immune system damage 
by the virus). Successful treatment with anti- 
retroviral drugs suppresses viral load and inter- 
rupts the decline in CD4 counts that would oth- 
erwise lead to AIDS. Both viral load and rate of 
CD4 cell decline are heritable properties—that 
is, these properties are causally affected by viral 
genetics, leading to correlation between an indi- 
vidual and whomever they infect (17-21). It has 
therefore been expected that viral load and CD4 
cell decline could change with the emergence 
of a new viral variant. We substantiate that ex- 
pectation with empirical evidence by reporting 
a subtype-B variant of HIV-1 with exceptionally 
high virulence that has been circulating within 
the Netherlands during the past two decades. 


Discovery of the highly virulent variant 

Within an ongoing study (the BEEHIVE project; 
www.beehive.ox.ac.uk), we identified a group of 
17 individuals with a distinct subtype-B viral 
variant, whose viral loads in the set-point win- 
dow of infection (6 to 24 months after a positive 
test obtained early in the course of infection) were 
highly elevated (Table 1, middle column). BEE- 
HIVE is a study of individuals enrolled in eight 
cohorts across Europe and Uganda, who were 
selected because they have well-characterized 
dates of infection and samples available from 
early infection, for whom whole viral genomes 
were sequenced. The 17 individuals with the dis- 
tinct viral variant comprised 15 participants in 
the ATHENA study in the Netherlands, 1 from 
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Switzerland, and 1 from Belgium. See materials 
and methods for details on the initial discovery. 


Replication of the discovery in Dutch 
ATHENA data 

To replicate the finding and to investigate this 
viral variant in more detail, we then analyzed 
data from 6706 participants in ATHENA with 
subtype-B infections (expanding on the subset 
of 521 participants in ATHENA who were eli- 
gible for inclusion in BEEHIVE). We found 92 
additional individuals infected with the viral 
variant, bringing the total to 109 such individu- 
als in either dataset. When replicating the BEE- 
HIVE test with the ATHENA data (Table 1, right 
column), we again observed a large rise in viral 
load in individuals with this viral variant: an in- 
crease of 0.54 logy) viral copies/ml (ie., a ~3.5- 
fold increase). The effect size was the same in a 
linear model including age at diagnosis and sex 
as covariates, and persisted in newly diagnosed 
individuals over time (Fig. 1A). Henceforth, for 
brevity, we refer to this viral variant as the “VB 
variant” (for virulent subtype B), to individuals 
infected with this variant as “VB individuals, 
and to individuals infected with a different 
strain of HIV as “non-VB individuals.” 


Search for closely related viruses 

To test whether the variant was more widely 
disseminated, we searched publicly available 
databases for similar HIV viral genotypes. All 
results had <95% sequence similarity to a repre- 
sentative viral sequence for the variant. Of the 17 
VB individuals originally found in BEEHIVE, 
one was from the Swiss HIV Cohort Study (22) 
(SHCS). By examining previously published 
data (23), we found that three other individuals 
from the SHCS were closely related (a phyloge- 
netic distance below 2.5%). The high coverage of 
the Swiss HIV Cohort [including 89% of report- 
ed new infections from 2009 through 2018, with 
~65% of the cohort sequenced (24)] makes it 
unlikely that many more VB individuals in Swit- 
zerland were undetected. Data to assess viral 
load or CD4 cell decline for these three individ- 
uals were not available, owing to early initiation 
of treatment. 


More-rapid CD4 cell decline 

At the time of diagnosis, CD4 counts for VB in- 
dividuals were already lower than for non-VB 
individuals by 73 cells/mm? [95% confidence 
interval (CI): 12 to 134]. These counts subse- 
quently declined faster, by a further 49 cells/ 
mm} per year (CI: 20 to 79), in addition to the 
decline for comparable non-VB individuals [49 
cells/mm? per year (CI: 46 to 51) for men di- 
agnosed at the age of 30 to 39 years]. The VB 
variant is therefore associated with a doubling in 
the rate of CD4 cell decline. These values are av- 
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erages estimated by using a linear mixed model 
adjusted for sex and age at diagnosis. Figure 1B 
illustrates the CD4 count decline that would be 
expected if disease progression were to continue 
linearly in the absence of treatment. Initiating 
treatment at a CD4 count of 350 cells/mm’, in- 
stead of immediately, was previously shown to 
substantially increase the subsequent hazard for 
serious adverse events (25). As seen in Fig. 1B, 
this stage of CD4 cell decline is expected to be 
reached in 9 months (CI: 2 to 17) from the time 
of diagnosis for VB individuals, as opposed to 36 
months (CI: 33 to 39) for non-VB individuals, in 
males diagnosed at the age of 30 to 39 years. It is 
reached even more quickly in older age groups, 
for which we found progressively lower CD4 
counts at time of diagnosis (table $1). At a CD4 
count of 200 cells/mm’, there is a high risk of 
immediate AIDS-related complications; without 
treatment this stage of decline would be reached, 
on average, between 2 and 3 years after diagno- 
sis for VB individuals and between 6 and 7 years 
after diagnosis for comparable non-VB individ- 
uals [the latter being similar to previous reports 
in Europe (26)]. 

The effect of the VB variant on CD4 cell de- 
cline remained after we adjusted for the effect of 
higher viral load. With this adjustment, VB indi- 
viduals have a CD4 count at diagnosis as would- 
be expected given their high viral loads, but their 
subsequent decline in CD4 counts is again twice 
as fast as for as comparable non-VB individuals 
with high viral loads—their rate of decline is 
accelerated by 44 cells/mm} per year (CI: 16 to 
72). Comparison of this additional decline with 
that expected from a +1 increase in log,, viral 
load, 15 cells/mm} per year (CI: 11 to 18), shows 
that the variant’s effect on CD4 count decline is 
equivalent to that expected from a +3.0 increase 
in logy) viral load. The same analysis of measure- 
ments of CD4 percentages (the percentage of all 
T cells that express CD4) showed that these also 
declined twice as fast for VB individuals, and 
again this doubling in speed of decline remained 
when we adjusted for the higher viral load of the 
variant (table S2 and fig. $1). 


No difference in CD4 cells after treatment, or 
in mortality 

Measurements of treatment success include 
CD4 cell recovery and mortality. CD4 counts 
and percentages after treatment initiation were 
similar for VB and non-VB individuals, as mea- 
sured with both linear mixed modeling of the 
CD4 dynamics (tables S3 and $4 and fig. S2) and 
an individual-matching procedure. The hazard 
for death (from any cause) was also similar: VB 
individuals had a relative hazard of 1.4 (CI: 0.7 to 
2.8, P = 0.35, Cox proportional hazards model). 
Our study had statistical power to detect only 
very large differences in mortality, as reflected 
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in the wide CI for relative hazard for death and 
shown in Fig. 1C. VB individuals had similar 
CD4 counts and mortality after treatment de- 
spite a faster CD4 cell decline before treatment; 
this could be explained by their tendency to 
start treatment sooner after diagnosis (fig. $3). 
For example, although the probability of having 
started treatment was estimated to be similar 6 
months after diagnosis [42% (CI: 41 to 44%) for 
non-VB individuals compared with 46% (CI: 35 
to 54%) for VB individuals], it was different 2 
years after diagnosis [65% (CI: 64 to 67%) for 
non-VB individuals and 93% (CI: 85 to 96%) for 
VB individuals]. Had VB individuals not started 
treatment earlier than others, lower CD4 counts 
at treatment initiation would have been expect- 
ed, potentially causing increased morbidity and 
mortality (25). This information could be rele- 
vant if VB or variants like it are found in settings 
with less widespread availability of HIV care. 


Characteristics of individuals infected with 
the VB variant 

VB individuals were mostly (82%) men who 
have sex with men, similar to non-VB individ- 
uals (76%). Age at diagnosis was also similar for 
VB and non-VB individuals (fig. $4). Neither 
ethnicity nor host genotype data were available, 
but the place of birth was mostly recorded as 
Western Europe for both groups (71% for non- 
VB individuals, 86% for VB individuals). VB in- 
dividuals were present in all regions of the Neth- 
erlands, but with a different distribution relative 
to that of non-VB individuals (N = 102 versus 
6604 individuals, P < 10-’, simulated Fisher’s ex- 


act test): VB individuals were more common in 
the south (25% of VB individuals versus 6% of 
non-VB individuals) and less common in Am- 
sterdam (20% versus 51%), as shown in table 
S5. Table S6 lists the hospitals included in each 
region. The average time from infection to diag- 
nosis, for men who have sex with men in this co- 
hort diagnosed in the late 2000s, was previously 
estimated to be 3.6 years (CI: 3.3 to 4.0) (27). 


Genotype of the VB variant 

Sequence data from the BEEHIVE proj- 
ect are whole-genome data, providing the 
17 whole genomes available for the variant; 
sequence data from ATHENA are partial pol 
gene data only, available for the additional 92 
VB individuals. We subtyped the 17 whole ge- 
nomes for the variant as pure subtype B [with 
100% support from two concordant methods 
(28, 29)], like most HIV-1 in the Netherlands. 
We predicted co-receptor usage from the 17 
whole genomes using two concordant meth- 
ods (30, 31): one was likely CKCR4-tropic; the 
other 16 were likely CCR5-tropic. Only one 
drug-resistance mutation was common for the 
VB variant: Met*!>Leu (M41L), present in 
91 of 109 partial pol gene sequences. Without 
other linked resistance mutations, M41L caus- 
es only low-level resistance to zidovudine (32, 
33). Two of the whole genomes were found to 
be recombinants between the VB variant and 
another subtype-B cluster in ATHENA (con- 
taining a small amount of sequence from the 
latter) and were excluded from subsequent 
sequence analysis. Among whole genomes 


in BEEHIVE and all whole genomes in the 
Los Alamos National Laboratory HIV Data- 
base (www.hiv.lanl.gov), none appeared to 
be a candidate for a “recombination parent” 
of the VB variant—i.e., the many mutations 
that distinguish the VB variant from any oth- 
er known virus appear to have arisen de novo, 
not through recombination. 

We compared the consensus sequence for 
the VB variant with the consensus of all Dutch 
subtype-B sequences in BEEHIVE, at both the 
amino acid and the nucleotide level: There 
were 250 amino acid changes and 509 nucleo- 
tide changes, as well as insertions and deletions. 
These alignments are included as data S1, and 
the amino acid alignment is illustrated in fig. S5. 
The distribution of nucleotide changes over the 
genome is in line with expectations (for exam- 
ple, fewer in the conserved pol gene region and 
more in the variable env gene region; see fig. S6). 
The VB-variant genotype is thus characterized 
by many mutations spread through the genome, 
meaning that a single genetic cause for the en- 
hanced virulence cannot be determined from 
the current data. 

We conducted descriptive analyses of the 
mutations that distinguish the VB variant 
from the Dutch subtype-B consensus. All of 
the amino acid-level changes are listed in data 
S2 with annotations. Of the observed amino 
acid substitutions, 30 were previously shown 
to be positively associated with escape from 
cytotoxic T lymphocyte (CTL) response for 
at least one human leukocyte antigen type, 
and 13 were shown to be negatively associated 


eee 
Table 1. Comparison of viral loads between individuals infected with the VB viral variant and other individuals. When analyzing the viral loads of 
individuals in the ATHENA study, we first excluded individuals who were in BEEHIVE, so that the test would be independent of the initial finding within the 
BEEHIVE study. After our statistical tests of viral load, we did not exclude BEEHIVE individuals from the ATHENA data for subsequent analyses. N, number of 
individuals after those without viral load measurements before treatment were excluded; IQR, interquartile range. 


Test 


Viral load measurements 
compared 


Mean and IQR of viral load 
in non-VB individuals, in logio 
copies per milliliter 


Mean and IQR of viral load 
in VB individuals, in logio 
copies per milliliter 


P value for increase 
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Discovery 
[BEEHIVE dataset 
(Europe)] 


Set-point viral loads for 


N = 15 VB individuals and N = 2446 individuals 


with any other HIV-1 strain 


5.10 
(IQR: 4.69 to 5.58) 


5.84 
(IQR: 5.57 to 6.09) 


bealOme 
(two-tailed t test, significant at a level 
of 5 x 10°° when Bonferroni-corrected 
for performing 50 such tests) 


Replication 
[ATHENA dataset 
(Netherlands), excluding 
overlap with BEEHIVE] 


Mean pretreatment viral loads for 
N = 91 VB individuals and N = 5272 individuals 
with any other subtype-B HIV-1 strain 


4.79 


5.38) 


1x10" 
(one-tailed t test) 
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Fig. 1. Clinical characteristics of VB individuals. Those infected with the highly 
virulent variant (VB individuals) are represented in red; those infected with 

any other subtype-B virus (non-VB individuals) are shown in blue. (A) Box-and- 
whisker plots of viral load, by year of diagnosis. Diagnosis dates were grouped 

to produce boundaries that coincide with years and roughly equal numbers 

of VB individuals (39 in 2002-2006, 35 in 2007-2008, and 27 after 2008; the 
pattern is robust to other groupings). (B) Expected decline in CD4 count in the 


(34). To provide context for these numbers, 
within Dutch subtype-B data in BEEHIVE 
we defined 16 other clades that are similar to 
the lineage in size (see materials and meth- 
ods). For each clade, we calculated the ami- 
no acid consensus sequence, compared this 
to the Dutch subtype-B overall consensus, 
and determined CTL escape mutations. This 
showed that the number of such mutations for 
the VB variant is typical when normalized by 
its overall level of divergence (fig. $11). We 
also calculated the ratio of rates of nonsyn- 
onymous and synonymous changes (d,/d,) for 
each gene, for the VB variant and the other 
16 Dutch subtype-B clades used for compar- 
ison. The VB variant had lower d,/d, values 
than all of the other clades for env, pol, and 
tat, though its values were not extreme; for the 
other genes, its d,/d, value was in the range 
spanned by the other clades (fig. $12). Finally, 
at codon position 77 of the protein Vpr, the 
consensus of all Dutch subtype-B sequences 
in BEEHIVE is glutamine, whereas the VB 
consensus is arginine. Glutamine was previ- 
ously found to be more common in long-term 
nonprogressors, and mutation to arginine in- 
creased T cell apoptosis in vitro and strong- 
ly increased T cell decline in mouse models 
(35). However, both alleles have been com- 
monly observed in subtype B to date (of 2178 
subtype-B Vpr protein sequences in the Los 
Alamos National Laboratory HIV Database, 
52% have glutamine and 36% have arginine), 
making it implausible that this mutation alone 
is the dominant mechanism for the virulence 
effect we observed. 
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Evolution of the VB variant 

The maximum-likelihood phylogeny in Fig. 2A 
shows the VB variant in the context of back- 
ground sequences, demonstrating that it is a 
distinct genetic cluster characterized by high 
viral loads. The phylogeny was inferred from 
15 whole-genome VB-variant sequences and 
100 randomly chosen whole-genome subtype-B 
background sequences from BEEHIVE. Figure 
2B shows a dated phylogeny for VB-variant se- 
quences only, estimated by using BEAST (36) and 
partial pol sequences. This phylogeny is colored 
by region, inferred with an ancestral state recon- 
struction by parsimony (minimizing changes of 
region). Amsterdam was assigned to the most 
recent common ancestor in 97% of trees in the 
posterior, showing that this reconstruction was 
robust to the uncertainty in the phylogeny. All 
VB-variant sequences date from 2003 onward; 
the time of their most recent common ancestor 
(TMRCA) was estimated as 1998.0 (95% credi- 
bility interval: 1995.7 to 2000.1). Trees were vi- 
sualized by using ggtree (37). 


Phylodynamics of the VB variant 

The effective population size (N,) of a pathogen 
is indicative of the number of infectious people. 
For the VB variant, N, was estimated by using 
a sky grid demographic model (38) in BEAST 
and is shown in Fig. 2C (scaled by the coalescent 
generation time Tt). N, increased until roughly 
2010; after this, there is more uncertainty but 
a possible downward trend [which may be an 
artefact of N. inference methods in the recent 
past (39)]. The proportion of VB-variant cases 
among all new subtype-B cases increased un- 
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absence of treatment. The model was adjusted for sex and age at diagnosis; 
values shown are for males diagnosed at the age of 30 to 39 years. Shaded regions 
indicate 95% Cls in the model's prediction of mean values, given the uncertainty 
in estimation of parameter values (it does not reflect the variability between 
individuals in each of the two groups, which is much greater). The dashed black 
line denotes a CD4 count of 350 cells/mm? (see text for details). (C) Probability 
of still being alive at a given time after diagnosis. 


til a peak in 2008 and subsequently decreased, 
though again with appreciable uncertainty [ab- 
solute numbers of both VB and non-VB diag- 
noses in our dataset have been decreasing since 
roughly 2008, and the data are right-censored by 
several years (fig. S7)]. In a recent analysis of an 
updated version of the ATHENA dataset (40), 
33 additional VB individuals were found, which 
suggests that VB diagnoses were stable until 
roughly 2013 and have since been declining, still 
with appreciable uncertainty. 

We calculated the local branching index 
(LBI), which is a measure of fitness (41). For 
HIV in a context in which most individuals start 
treatment without long delays, the LBI is closely 
related to transmissibility (see supplementary 
text). Compared with that of other transmission 
clusters, the LBI was higher for the VB variant 
both in BEEHIVE (P = 2 x 10°’) and ATHENA 
(P <2 x 10; fig. S8). High pretreatment trans- 
missibility may explain why the VB variant grew 
to be the 10th largest of 1783 clusters in the full 
ATHENA tree. 


Tree imbalance and evolution within the 
VB-variant clade 

We found nothing unusual in the extent to 
which the VB variant’s phylogeny is imbalanced, 
nor did we detect any indication of further evo- 
lution of viral load within the variant’s clade 
(supplementary text and fig. S9). 


The first sampled VB individual 

We retrieved and sequenced two additional sam- 
ples from the VB individual who was diagnosed 
in 1992, 10 years before subsequent diagnoses 
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of other VB individuals. Phylogenetic analysis 
suggested that this individual was infected with 
a virus that had evolved most of the way, but not 
entirely, toward VB-variant viruses typical of lat- 
er dates (supplementary text and fig. $10). This 
individual was diagnosed in Amsterdam, con- 
sistent with the aforementioned ancestral recon- 
struction of region. In the 10 years before this 
first VB diagnosis, the proportion of individuals 
diagnosed in the Netherlands for whom a viral 
sequence was available was roughly one-third. 
The proportion of those diagnosed or undiag- 
nosed would be smaller still. This means that the 
infector of the 1992 individual was most likely 
not sampled, and indeed two or three steps in 
the transmission chain could have been unsam- 
pled. The long phylogenetic branch leading to 
the 1992 individual could therefore represent 
between-host evolution, not necessarily within- 
host evolution in a single individual. 


Discussion 

Previous studies of the heritability of viral load 
and CD4 cell decline led us to expect that these 
properties could change with the emergence of 
a new variant of HIV-1. We provided strong ev- 
idence for this, discovering a virulent subtype-B 
variant (the VB variant) that has been circulat- 
ing in the Netherlands since the late 1990s. We 
characterized the variant’s genotype and evo- 
lutionary history, as well as its association with 
high viral loads, rapid decline of CD4 cells, and 
increased transmissibility. We found 109 indi- 
viduals with the variant (VB individuals) whose 
age, sex, suspected mode of transmission, and 
region of birth are all typical for people living 
with HIV in the Netherlands. This suggests that 
the observed association is causal: The increased 
virulence is a property of the virus rather than 
a confounding property of individuals in this 
transmission cluster. An absence of viral load 
evolution inside the clade of VB variants sug- 
gests that the increased virulence is a property of 
the whole clade and not a subset of it—i.e., that 
the virulence evolution occurred on the long 
phylogenetic branch that connects this clade to 
other known viruses. 

Deferring the initiation of treatment until the 
measurement of a CD4 count’s decline to <350 
cells/mm}? or the onset of AIDS, instead of im- 
mediate treatment initiation upon diagnosis, 
was previously shown to increase the subsequent 
hazard of serious AIDS-related events by a fac- 
tor of 3.6 (CI: 2.0 to 6.7) and of any serious event 
(including death) by a factor of 2.4 (CI: 1.6 to 
3.3) (25). This long-lasting immunological dam- 
age justifies WHO's classification of 350 CD4 
cells/mm? as “advanced HIV.’ Without treat- 
ment, advanced HIV is expected to be reached 
in only 9 months (CI: 2 to 17) from the time of 
diagnosis for VB individuals, compared with 36 
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Fig. 2. Phylogenetic A 
and phylodynamic 

analysis of the 

VB variant. (A) Whole- 

genome maximum- 

likelihood phylogeny of 

15 VB-variant sequences 

and 100 background 


logyo(viral load) 


subtype-B sequences. 
The color of each circle 
indicates the individual's 
viral load in logio 
copies per milliliter. The 
inset scale bar shows 
the branch length scale 
in units of substitutions 
per site. (B) Dated 
maximum-clade-credibil- 
ity tree for 107 partial pol 
gene sequences from the 
VB variant. Colors indi- 
cate geographical regions 
(N, E, S, and W: north, 
east, south, and west), 
which are known for the 
tips of the tree but are 
otherwise inferred by 
ancestral state recon- 
struction. The gray violin 
plot superimposed on 
the root node shows the 
posterior density for its 
date (i.e., the TMRCA); 


1994 contains overflow to —- 


earlier dates for clarity. 
(C) Effective population 
size (Ne) (scaled by 

the coalescent generation 
time t) over time with 
95% credibility intervals, 
with the same time axis 
as in (B). 


i>) 


N.t (years) 


1995 


months (CI: 33 to 39) for non-VB individuals, 
in males diagnosed at the age of 30 to 39 years. 
Advanced HIV is reached even more quickly 
in older age groups, and there is considerable 
variation between individuals around these ex- 
pected values. Many individuals could therefore 
progress to advanced HIV by the time they are 
diagnosed, with a poorer prognosis expect- 
ed thereafter in spite of treatment. In practice, 
there is still substantial variation in the delay 
from becoming infected to starting treatment, 
making the VB variant a concern even in the 
high-awareness and highly monitored context of 
the Dutch HIV-1 epidemic. In contexts with less 
awareness and monitoring, in which diagnosis 
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often occurs later in infection, the probability of 
reaching advanced HIV before diagnosis would 
be even greater. 

Future in vitro investigations could more 
firmly establish the role of the viral genotype, 
and reveal an as-yet-unknown virulence mecha- 
nism at the molecular or cellular level. A higher 
replicative capacity of the virus might be ob- 
served, given the increased viral loads seen here. 
However, it is likely that there will be more to the 
virulence mechanism: The VB variant doubles 
the rate of CD4 cell decline, measured with both 
counts and T cell percentages, even after adjust- 
ing for its higher viral load. This rate is equiva- 


lent to the acceleration of CD4 degradation that 
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would be expected from a 3.0 log,, increase in vi- 
ral load, though we observed a 0.54 to 0.74 log,, 
increase. This means that the virulence normal- 
ized by the amount of virus—the “per-parasite 
pathogenicity” (42, 43), which for HIV is herita- 
ble (19)—is much higher for the VB variant. Us- 
ing two aforementioned methods, we predicted 
that, of the 17 whole genomes available, 16 use 
only the R5 co-receptor for cell entry, which is 
typical for subtype-B viruses in early infection 
(13). This finding suggests that the underlying 
virulence mechanism is distinct from the well- 
known effect of cell tropism (14, 15). 

Previous studies have reported population- 
wide increases (44, 45) and decreases (46) in 
virulence over time. Mixed results between indi- 
vidual studies [see (47) for a meta-analysis] can 
be attributed to differences in epidemic context 
(such as the dominant subtypes), statistical pow- 
er, and observational biases over time. Temporal 
virulence trends could also be due to changing 
confounders, such as a shift in which subpopu- 
lations are most affected, the stage of infection at 
time of diagnosis, or coinfections. We expand on 
these studies by resolving a change in virulence 
to an individual viral variant. 

The basic theory of an _ infectiousness- 
virulence trade-off is that infectiousness and 
virulence are linked (for example, by how fast a 
pathogen replicates in its host) and that selec- 
tion pressures favor intermediate values rather 
than extreme ones. If infectiousness is too low, 
the pathogen cannot be transmitted when its 
host contacts other hosts, but if virulence is too 
high, the host becomes too ill to have such con- 
tacts. In the case of HIV, the implication of this 
theory is that we would not expect highly virulent 
viruses to spread widely through a population 
in the absence of widespread treatment, because 
their hosts would progress to AIDS very quick- 
ly, limiting the opportunities for transmission 
(9). Most of the evolution that gave rise to the 
VB variant occurred before 1992, before effec- 
tive combination treatment was available. How- 
ever, our findings may stimulate further interest 
in whether widespread treatment shifts the bal- 
ance of the infectious—virulence trade-off toward 
higher virulence, thus promoting the emergence 
and spread of new virulent variants. Previous 
modeling studies have investigated this idea for 
pathogens generally (48) and for HIV specifical- 
ly (49, 50). We discuss subtleties of the argument 
in the supplementary text, but our conclusion is 
that widespread treatment is helpful to prevent 
new virulent variants, not harmful. The absolute 
fitness of viral variants must be considered in 
addition to their relative fitness, and treatment 
reduces the total onward transmission over the 
course of one infection, regardless of virulence. 
Put simply, “viruses cannot mutate if they cannot 
replicate” (anonymous), and “the best way to stop 
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it changing is to stop it” [Marc Lipsitch (51)]. Ear- 
ly treatment also prevents CD4 cell decline from 
leading to later morbidity and mortality; thus 
clinical, epidemiological, and evolutionary con- 
siderations are aligned. Our discovery of a highly 
virulent and transmissible viral variant therefore 
emphasizes the importance of access to frequent 
testing for at-risk individuals and of adherence to 
recommendations for immediate treatment initi- 
ation for every person living with HIV (https:// 
www.who.int/health-topics/hiv-aids). 
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Pandemic defense: The key is global collaboration 


Gavin Cloherty* 


s the world adapts to its third year of living with 
COVID-19, one thing has become clearer than ever: 
Viruses are in a race with science, and scientists must 
find ways not just to keep pace but to move faster. We 
have now entered an era where epidemic and pan- 
demic threats have become the new normal. 

Scientists need creative and innovative ways to hunt, track, 
and discover future viral threats as well as those facing us 
today. From highly transmissible COVID variants to deadly flu 
strains and more, viruses move fast and evolve even faster. 
And so must we. 

It's a tall order, but success can be achieved through a 
combination of science and collaboration. 

This means that we must take the lessons we've learned 
from the current pandemic and translate them into action, 
which includes developing strong partnerships between 
government and private companies; creating lasting coa- 
litions of doctors, scientists, and researchers to watch and 
respond to outbreaks as they occur; and helping to train the 
next generation of epidemiologists. 

Creating these partnerships between the private sector, 
academia, and governments can ensure we are ready for the 
next disease that comes our way. With 10 million times more 


*Head of Infectious Disease Research and the Pandemic Defense Coalition, Abbott, 


Abbott Park, IL 


viruses on Earth than stars in the universe; there’s no shortage 
of pathogens that could cause the next public health crisis. 

During the COVID-19 pandemic, private companies 
stepped up to rapidly develop and manufacture tests. At the 
time of the original Omicron variant surge, there were nine 
at-home rapid COVID-19 tests approved by the U.S. Food 
and Drug Administration—and 375 million tests were avail- 
able in the market?—but the volume was still not enough to 
meet household needs. It's critical that governments contin- 
ue to work with private companies to procure key materials, 
secure manufacturing capacity, and coordinate a public 
health response strategy to raise pandemic defenses. 

To be as efficient as possible, the public and private sec- 
tors will have to collaborate across continents and oceans, 
too. Viruses know no borders, so neither should our efforts to 
combat them. That's why Abbott, a global health care com- 
pany building on a nearly 30-year history in viral surveillance, 
formed the Abbott Pandemic Defense Coalition—a first-of- 
its-kind, industry-led global scientific, academic, and public 
health partnership dedicated to improving early detection 
and rapid response to viral threats. The coalition, compris- 
ing 14 scientific, academic, and public health organizations 
located around the world, is actively working to help prevent 
the next pandemic. It works with partners on the ground to 
track SARS-CoV-2, hepatitis, HIV, and other global viruses by 
analyzing millions of viral sequences and screening samples 
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isolated from patients with countless unknown illnesses in 
order to stay one step ahead of the next threat, so we can 
respond quickly and prevent widespread impact. 

If a potential threat is discovered, the sequence of the vi- 
ruses is published in a public database so that health officials 
and laboratories can work together to identify if it's a novel 
strain, or a virus that has been previously detected. 

That approach proved effective when the COVID-19 
Omicron variant popped up in South Africa late last year 
and again with new subvariants arising across the globe this 
year. Our partners in Africa shared variant sequences and 
tested samples, allowing Abbott to quickly confirm that our 
COVID-19 tests could detect it. 

When faced with a new viral threat, Abbott will develop 
diagnostic testing to assist in containment efforts. The goal 
is to anticipate viral threats before they occur, enabling rapid 
responses and diagnostic testing to track how these threats 
are emerging and prevent them from becoming pandemics. 

To fight the current pandemic from all sides, Abbott de- 
veloped and produced 14 types of COVID-19 tests for use in 
hospitals, laboratories, point-of-care facilities, and homes. 
Further, because we knew the virus would mutate, we imme- 
diately included SARS-CoV-2 in our surveillance program so 
that we could study, sequence, and understand how it was 
changing. 

Today, coalition partners are putting this model to the test 
by actively collecting, logging, and monitoring diverse patho- 
gens to keep pace with new and changing viruses in every 
region of the globe—from rainforests in Colombia, Brazil, and 
Thailand, to the expansive savannas of South Africa, to the 
metropolitan areas of India and the United States. 

In just the past year, our coalition has identified numerous 
new lineages of the SARS-COV-2 virus. Gaining insight into 
the mutation rate over time, we've ensured that our mo- 
lecular, antigen, and serological tests can detect variants; 
studied immune systems after vaccination to better under- 
stand COVID-19 recovery; and discovered other undetected 
pathogens in circulation in member countries, alerting public 
health officials so that proper testing and monitoring can be 
put in place. 


As unknown viral threats become more common, we'll 
need more eyes on the ground. That's why we and the coa- 
lition are committed to growing and supporting the ranks of 
global virus hunters. Abbott is helping increase the number 
of future virus hunters through the coalition’s partnership 
with the Training Programs in Epidemiology and Public 
Health Interventions Network (TEPHINET), which is already 
training field epidemiologists in more than 100 countries. And 
we'll continue supporting scientists at each of our coalition 
sites to develop infrastructure and skills for future response. 

By expanding training programs, we can attract more virus 
hunters to be educated at research sites across the world 
and learn to respond more effectively in the fight against 
infectious diseases. A part of Abbott’s sustainability plan is to 
build capacity and scientific skills today—which will enable 
us to strengthen pandemic preparedness and protect com- 
munities against the threats of tomorrow. 

In an increasingly connected and interdependent world, it 
is imperative to streamline pandemic detection and re- 
sponse, improving links between international networks, 
governments, and public health organizations that are 
involved in global health decisions. 

As we make our way through yet another year of living 
with COVID-19, we have to keep taking steps to stop the 
next pandemic. The time to work together to stop future viral 
threats is now. And collaboration is key. 


thttps:/Avww.smithsonianmag.com/air-space-magazine/there-are-more-viruses-earth-there-are-stars-universe-180974433. 
“https://www.whitehouse.gov/briefing-room/statements-releases/2022/01/14/fact-sheet-the-biden-administration-to-begin-distributing-at-home-rapid-covid-19-tests-to-americans-for-free. 
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REPEATING ITSELF? 


This first-of-its-kind* global network is dedicated 
to detecting and identifying viral threats. 

Meet the teams who are working around the 
world to help prevent the next pandemic. 
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OVERVIEW 


The Abbott Pandemic Defense Coalition connects organizations around the world that contribute to global health, scientific research 
and pandemic preparedness. Whether they’re conducting viral surveillance and discovery, analyzing genomic sequences, or training the 
next generation of epidemiologists and virologists, all partners play a vital role in staying a step ahead of infectious disease outbreaks. 
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Rush University Medical Center 
Rush is one of the leading academic health systems in the U.S. and works to discover 


novel treatments through research and clinical trials. 


Specialization: Clinical Research, Immunology, Informatics, Next-Generation 
Sequencing, Training and Education 


Universidade de Sao Paulo (USP) 


USP is a leading university and plays a fundamental role in advancing scientific, 
technological and social research in Brazil. 


Specialization: Clinical Research, Health Geography, Next-Generation 
Sequencing, Phylogenetics 


Colombia-Wisconsin One Health Consortium 

at the Universidad Nacional de Colombia 

University of Wisconsin—Madison and Universidad Nacional de Colombia strengthen 
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University of the West Indies 


The University of the West Indies aims to advance learning, create knowledge and 
foster innovation for the positive transformation of the Caribbean and the world. 
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Fiocruz 


Fiocruz, or the Oswaldo Cruz Foundation under the Ministry of Health, is the most 
prominent science and technology institution in health in Latin America. 
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Sequencing, Phylogenetics, Training and Education, Virology 


Université Quisqueya 


Université Quisqueya is a private education and research institution that is the first 
of its kind in Haiti. 
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Uganda Virus Research Institute (UVRI) 


UVRI conducts health research pertaining to human infections and diseases and enables 
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Education, Virological Research 


University of Zimbabwe 


This public research institution provides academic programs, research, and advisory 
services while developing strategic partnerships for innovative research, outreach and 
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Centre for Epidemic Response and Innovation (CERI) 


CERI uses its expertise in genomics to identify new variants of pathogens and to 
advance basic and translational science to improve public health efforts in Africa. 


Specialization: Bioinformatics, Genome Sequencing, Phylogenetics, Training 
and Education 


Institut de Recherche en Santé, de Surveillance Epidémiologique 
et de Formation (IRESSEF) 


An institution known for health research, IRESSEF aims to support public health policies 
to make healthcare accessible to African populations. 


Specialization: BSL-3, Epidemiological Surveillance, Clinical Research, Virus Discovery 


KwaZulu-Natal Research Innovation and Sequencing Platform (KRISP), 
Genomic Centre of the University of KwaZulu-Natal 


KRISP utilizes big data, bioinformatics and genomics analysis to understand the infectious 
diseases of today to better respond to future mutations or variants. 


Specialization: Bioinformatics, Genome Sequencing, Phylogenetics, Training 
and Education 
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National Center for Disease Control and Public Health (NCDC) 


The NCDC is Georgia’s central public health and research institution under the 
authority of the Ministry of Internally Displaced Persons (IDPs) from the Occupied 
Territories, Labour, Health and Social Affairs. 


Specialization: Clinical Research, Connecting Laboratories and Physicians, 
Countrywide Surveillance, Hepatitis Elimination Programs, Informatics 


Faculty of Medicine, Siriraj Hospital, Mahidol University 


Our partners at Siriraj Hospital, the leading medical school in the country, provide 
the latest medical services support and education while driving innovative infectious 
disease research. 


Specialization: Bioinformatics, Biorepository, Clinical Research, Diagnosis, 
Health Geography 


Y.R. Gaitonde Centre for AIDS Research and Education (YRG Care) 
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Specialization: Clinical Research, Epidemiology, Patient Care Services for High-Risk 
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Protecting public health through viral 
molecular surveillance 


Mary A. Rodgers* 


Pandemic Preparedness in a Changing World 


he continuous evolution and divergence of viral 

pathogens, which are both fascinating and deadly, is 

constantly threatening the advances we have made 

in combating infectious diseases through diagnostics, 

therapeutics, and vaccines. All these tools rely upon 

viral sequence conservation to maintain effectiveness, 
making them prone to failure when viruses mutate. The mag- 
nitude of the challenge of viral diversity is most notable in the 
case of highly divergent viruses such as human immunode- 
ficiency virus (HIV), hepatitis B virus (HBV), and hepatitis C 
virus (HCV), which have diversified into lineages that differ 
by up to 37.5% and are responsible for 38 million, 257 million, 
and 71 million chronic infections worldwide, respectively 
(1-4). With increasing global travel, immigration, and military 
deployments, the geographical footprint of these viruses is 
expanding (5, 6). Although SARS-CoV-2 has lower overall 
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viral diversity than HIV, HBV, or HCV, the pandemic spread 
of variants of concern around the world in a matter of weeks 
has repeatedly altered local transmission dynamics, mak- 
ing small changes in sequence a major driver of COVID-19 
illness (7). Therefore, the comprehensive spectrum of viral 
genetic diversity must be accounted for in the design and 
evaluation of diagnostics, therapeutics, and vaccines target- 
ing these and other divergent viruses. The consequences of 
an incomplete consideration of viral diversity include false 
negative diagnostic results; underquantification of viral load 
levels; vaccine breakthrough infections; treatment failures; 
and ultimately poor control of infectious diseases (8-76). 
Molecular surveillance serves as the foundation for the 
robust design of diagnostics, therapeutics, and vaccines to 
effectively confront viral diversity. Complete characterization 
of the full diversity of pandemic viruses such as HIV, HBV, 
HCV, and SARS-CoV-2 requires a global effort, since even 


one gap in the worldwide molecular surveillance system puts 


the global population at risk. In addition to sequences from 
surveillance programs, positive clinical specimens are also 
critically important for continuously monitoring the impact 
of viral diversity on diagnostic tests. These paramount needs 
were recognized over 28 years ago when the Abbott Glob- 
al Surveillance Program (AGSP) was established in 1994, 
with the goal of characterizing HIV diversity to improve the 
performance of diagnostic tests. The program grew out of 
decades of leadership in infectious disease diagnostics that 
began with Abbott's first test for HBV in 1972 and the first 
FDA-approved test for HIV in 1985. AGSP has since expand- 
ed to collecting more than 90,000 clinical specimens from 
46 different countries on six continents, through collabora- 
tions with hospitals, blood banks, research institutions, and 
public health agencies (77, 18). Access to the data generated 
by the program has been democratized for the benefit of 

all researchers through deposition of more than 5,000 new 
viral sequences into public databases and the publication 

of 152 peer-reviewed articles. Furthermore, AGSP-derived 
sequences and specimens serve as the foundation for the 
development of infectious disease diagnostic assays that can 
tolerate viral diversity; these assays serve as our critical first 
line of defense in the efforts to contain and eliminate HIV, 
viral hepatitis, and SARS-CoV-2. As a key aspect of pan- 
demic preparedness, AGSP serves as a pillar of the Abbott 


Pandemic Defense Coalition. 
Over the course of decades of molecular surveillance, 
AGSP has made several important discoveries in HIV 
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research. A majority of the most divergent 
HIV strains have been identified and se- 
quenced by AGSFP, including 11 out of 17 
sequence-confirmed HIV Group N infections 
identified to date (19-24) and one of the 

only two Group P cases ever to have been 
discovered (25). Numerous HIV-2 and HIV-1 
Group O strains have also been characterized 
by AGSP, some of which differ from Group M 
sequences in the envelope gene—a primary 
target of the antibody response to HIV—by 
>50% (20, 26-29). More recently, the newest 
subtype of HIV Group M, subtype L, was es- 
tablished with the third genome sequenced 
by AGSP in 2019 (30). These diverse HIV 
strains illustrate the impact of a robust viral 
surveillance program on diagnostic assay 
development, since Abbott's ARCHITECT 
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HIV Ag/Ab Combo and RealTime HIV-1 tests were designed 
with awareness of viral diversity prior to the discoveries 

of these strains, yet they were all readily detected without 
modification of the tests (27, 30). 

AGSP has also brought new insights and understanding 
about the course of the HIV pandemic (37). AGSP’s HIV sur- 
veillance in Africa has uncovered an unexpected population 
of potential HIV elite controllers in the Democratic Republic 
of Congo (DRC) who can control the virus naturally without 
suppression by antivirals (32). The discovery of this group of 
patients was an unexpected outcome of a viral surveillance 
study in which the frequency of samples with viral load was 
unusually low in a seropositive HIV panel from Kinshasa. 
Through systematic follow-up testing, AGSP eliminated 
diagnostic errors, divergent viruses, and antiretrovirals as 
potential explanations. While the frequency of HIV elite con- 
trollers is typically less than 0.5% of the population in other 
regions of the world, their prevalence in DRC was estimated 
to be 2.7%-4.4%, a figure that remained consistent over a 30- 
year period, suggesting that a unique capacity to control HIV 
infection arose early in the epicenter of the pandemic. These 
results serve as a spark for renewed investigation in this 
area to characterize the mechanisms by which potential elite 
controllers in DRC achieve viral suppression. 

After the addition of hepatitis virus surveillance into AGSP 
in the 2000s, several important contributions have been 
made in molecular surveillance of HBV, HCV, and hepatitis 
D virus (HDV) that have benefited public health. In a large 
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study conducted with >13,000 participants in Cameroon, 
where the prevalence of viremic HCV was previously only 
modeled and not measured, the molecular prevalence of 
HCV was determined to be 2.47%, identifying higher rates in 
males (3.81%) and persons over the age of 40 (9.09%) (33). 
In this same group, a high prevalence of HBV infections was 
found (10.45%), and, among these, the prevalence of HDV 
antibodies was 46.73% (20, 34). Remarkably, HDV RNA was 
found in 34.2% of all HBV* cases, with the highest rates 
found in children ages 2-17 years old (34). 

The sequences generated through molecular surveillance 
studies can also provide deeper insight into the transmission 
dynamics of a countrywide epidemic. For example, in a 2020 
AGSP molecular surveillance study conducted in a group of 
people who inject drugs (PWID) in five cities in India, 10 dif- 
ferent subgenotypes were present, whereas significantly less 
diversity was seen for HIV in the same cohort, with only one 
major strain predominating (subtype C) (35). The distribu- 
tion of the HCV genotypes across different cities suggested 


novel introductions of HIV and HCV into PWID populations 
consistent with known drug trafficking routes along nearby 
borders. Spatiotemporal phylodynamic analysis of the HCV 
sequences from this study found genetic links within trans- 
mission networks between geographically distant Indian 
cities, indicating that targeted interventions must account for 
intercity travel to be successful (36). 


AGSP has kept pace with the emergence of SARS-CoV-2 
variants during the COVID-19 pandemic by characterizing 
specimens from four continents and testing them in 11 differ- 
ent antigen, molecular, and antibody COVID-19 assays, with 
excellent performance demonstrated due to well-conserved 
assay targets and robust design (37). All major variants 
of concern to date (Alpha, Beta, Gamma, Delta, and Omi- 
cron) have been sequenced in the program, with genomes 
deposited in the Global Initiative on Sharing All Influenza 
Data (GISAID) repository (38). Important insights about the 
pandemic have also been gained from AGSP-derived SARS- 
CoV-2 genomes. Notably, genomes carrying spike protein 
receptor-binding domain (RBD) mutations linked to increase 
transmission and vaccine escape in variants of concern were 
found in Senegal during the second wave of the pandemic 
in the winter of 2020-2021 (39). The mutations were found 
in lineages that were not designated as—and never be- 
came—variants of concern, raising questions about the path 
a variant must take to become a global concern. Interestingly, 


Original Content from Sponsor and Partners 


the Alpha variant of concern was present during the second 
wave, but it never became the predominant strain in Senegal 
in contrast to other countries where it rapidly became the pri- 
mary lineage. This may be due to the pre-existing dominance 
of the B.1.1.420 lineage in Senegal at the time that Alpha was 
introduced (40). The B1.1.420 lineage was imported from 

Italy and gained fitness in Senegal before spreading global- 
ly during the second wave, making it a lineage to continue 
monitoring carefully (40). 

The continual discovery of new strains and deeper insights 
gained from viral sequences in AGSP highlight the impor- 
tance of sustained vigilance in monitoring viral sequences 
through a global approach to keep pace with viral evolu- 
tion. To further improve our coverage of circulating strains, 
techniques that increase sequencing throughput and quality 
should continue to be implemented in as many geographies 
as possible. For instance, advances in improving next-gen- 
eration sequencing (NGS) depth and coverage for low viral 
load specimens have been developed at Abbott through 
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enrichment strategies for SARS-CoV-2, HIV, and hepatitis 
viruses. Implementation of these techniques at additional 
sites will increase the number of genomes generated (47-44). 
Expanded application of surveillance methods to passively 
monitor the environment will also improve real-time visibil- 
ity of strains that are circulating in specific areas. Notably, 
recent implementation of wastewater monitoring of SARS- 
CoV-2 sequences has been shown to successfully predict 
surges in cases and could be used to prompt public health 
responses when new variants are detected (45, 46). Howev- 
er, passive surveillance systems do not exclusively monitor 
human pathogens, and environmental specimens cannot 

be used to monitor diagnostic assay performance. As more 
and more sequencing data is generated through surveillance 
efforts, efficient analysis of the data will continue to pose a 
challenge, and innovative solutions in this area will greatly 
improve surveillance throughput and accessibility. Resources 
that provide up-to-date status reports pulled from surveil- 
lance databases will also become an essential part of the 
suite of monitoring tools used to make sense of the data gen- 
erated. The more that surveillance sequencing data is shared, 
the stronger our defenses will be against emerging strains 

of divergent viruses such as HIV, hepatitis, and SARS-CoV-2. 
Since new variants of any of these viruses could cause major 
outbreaks and surges globally, improved surveillance of 
these pathogens is a critical pillar of pandemic preparedness. 
As the only diagnostic test manufacturer with such a unique 
long-standing and large-scale surveillance program, Abbott 
provides a vital tool to stay ahead of these dynamic viruses. 
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Virus discovery for diagnostic 
preparedness 


Michael G. Berg* 


he Abbott Pandemic Defense Coalition (APDC) is a scientific and 
public health partnership whose primary purpose is the early 
detection and mitigation of infectious disease threats of pan- 
demic potential. To this day, a large percentage of presumed 
infectious diseases go undiagnosed. For example, 15%-30% 
of acute respiratory illness, 30%-50% of gastroenteritis, and 
50%-70% of meningitis/encephalitis have no etiology determined (7). 
In this context and in contrast to “surveillance” of known pathogens, 
“discovery” refers to detection of novel, emerging viruses that have 
not been previously characterized or associated with disease. Unbiased 


next-generation sequencing (NGS) of patient samples has the potential to 
dramatically improve these numbers, but there is spirited discussion about its 
actual diagnostic relevance and actionability in clinical settings (2-4). 
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However, the question as to whether NGS and associated 
bioinformatic algorithms [sequence-based ultrarapid patho- 
gen identification (SURPI), Kraken, IDseq, among others] 
can be used to discover the presence of and assemble the 
genomes of new viruses is not a matter of debate. In recent 
decades, various new viruses have been discovered by NGS 
with clear links to disease (7, 5). This number is dwarfed by 
those described in nonhuman host reservoirs (e.g., bats 
and rodents) and insects (e.g., ticks and mosquitos) with 
potential for zoonosis or vectoring, respectively (6-72). One 
need not touch a sample, with new viruses being found by 
combing through the vast reams of data in the sequence 
read archive (73). It is important to appreciate that beyond 
determining the sequence, numerous other criteria need to 
be established to link a pathogen with disease, including 
elicitation of a host response, clustering of cases, and fulfill- 
ment of Koch's postulates (74). 

During an outbreak, finding other cases and making the 
correct association with disease ceases to be an issue. Yet, 
distributing the technical NGS capability to make that initial 
identification of a new virus and to monitor strain diversity 
following its emergence are areas that we can focus on for 
pandemic preparedness. Whether it’s due to globalization, 
international travel, changes in the environment, or en- 
croachment of humans into natural habitats, pandemic-scale 
viruses such as severe acute respiratory syndrome (SARS), 
Middle East respiratory syndrome (MERS), Zika, Ebola, and 
SARS-CoV-2 have increased in frequency, and it is likely that 
other outbreaks with pandemic potential will emerge again. 
Diagnostics play the most essential role throughout a public 
health response to a pandemic. Quickly designing molecu- 
lar and serologic/rapid tests can be done by any entity, but 
only industry has the resources, technical knowledge, and 
logistic infrastructure to scale up production rapidly enough 
to meet the global demand. It all starts with having a genome 
sequence. For this reason, Abbott's Virus Discovery program 
performs a necessary function within the APDC framework 
to bolster public health efforts (75). 


The evolution of virus discovery at Abbott 

The Abbott Diagnostics division was established in 1972 with 
the development and distribution of the AUSRIA-125 blood 
screening test for the hepatitis B virus (HBV) Australian 
antigen. During this period, the search for the elusive non-A, 
non-B hepatitis virus was at its height. Around the same time 
that Chiron Corporation discovered hepatitis C virus (HCV) 
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in 1989, Abbott scientists identified another group of flavivi- 
ruses that appeared to be associated with hepatitis and other 
transfusion-transmitted diseases. During the 1990s, GB virus 
type C (GBV-C) was ultimately found to be ubiquitous in both 
healthy and sick individuals, and so while a new Pegivirus 
species was established, these viruses were not pathogenic 
(16). The pace of discovery was hampered by reliance on 
labor-intensive methods available at the time, such as clon- 
ing and Sanger sequencing, radio-labeling, and virus prop- 
agation in culture. The advent of high-throughput methods 
such as microarrays for viruses (ViroChip) held great promise 
for the virus discovery field. In 2006, studies using the Viro- 
Chip indicated that xenotropic murine leukemia virus-related 
virus (XMRV) was associated with prostate cancer (PC) 

(17, 18). A flurry of follow-up reports additionally suggested 

a causal role for XMRV in chronic fatigue syndrome (CFS) 
(19, 20). Numerous independent studies failed to confirm an 
association of XMRV with PC and CFS, and Abbott scientists 
created molecular and serologic tools that ultimately helped 
to refute these claims (27, 22). While XMRV was determined 
to be a real virus arising from recombination of endogenous 
murine proviruses during passage of a human PC tumor in 
immunocompromised mice, the field at large concluded that 
the initial results were due to laboratory contamination with 
infected cell lines (23, 24). Important scientific lessons were 
learned from both “discoveries” and highlighted the many 
pitfalls and challenges associated with this work. Abbott not 
only remained engaged in this space but committed to taking 
the lead. 

At this same time, the high-throughput sequencing rev- 
olution was underway with 454 and Illumina technologies 
quickly eclipsing microarrays. The concept of “metagenomic 
NGS” (mNGS) was born: unbiased sequencing of all nucleic 
acids in a patient sample to detect bacteria, parasites, fungi, 
and viruses, and to potentially assemble the genomes of new 
viruses. Abbott engaged in a unique, long-term partnership 
with the leading experts in this burgeoning field at University 
of California, San Francisco (UCSF) to establish the Viral Di- 
agnostics and Discovery Center (VDDC). Leading the VDDC 
was Charles Chiu, M.D., Ph.D. Among his lab's many founda- 
tional contributions was leveraging mNGS to report the asso- 
ciation of known viruses (e.g., astrovirus) with unexpected dis- 
ease outcomes (e.g., encephalitis) (25). They also discovered 
several novel viruses, including MW polyomavirus in acute 
pediatric diarrhea cases, Bas-Congo rhabdovirus causing 
hemorrhagic fever, and Lone Star bunyavirus in Amblyomma 


ticks (26-28). Importantly, this research demonstrated the 
promise of mNGS to deliver an actionable diagnosis, saving 
a young teenager from a fatal case of neuroleptospirosis (29). 
Pivotal to all this work was the development of the SURPI 
data-analysis pipeline (30). Vast amounts of sequences could 
now be aligned to the entire GenBank NT (Nucleotide) data- 
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base in a few short hours, removing host background DNA/ 
RNA, identifying known pathogens, and exploring clues as to 
the presence of a potential divergent virus. As intended, the 
focus then shifted to building NGS and bioinformatics capa- 
bility internally at Abbott. A key milestone that evidenced this 
transfer of expertise and methodology was the discovery of 
human pegivirus-2 (HPgV-2) 

in 2015 (37). Starting from just 

three sequence reads with remote 
homology to simian pegivirus A, 
Chiu’s group identified the index 
case and assembled the HPgV-2 
draft genome. It was Abbott’s team 
that developed this story further 

by identifying another dozen 

cases of infection, made possible 
by our ability to rapidly prototype 
molecular and serology tools. A 
quantitative reverse transcription 
(RT)-PCR on the Abbott m2000 
RealTime System simultaneously 
detecting HPgV-1 (GBV-C) and 
HPgV-2 viral RNA, together with 

a high-throughput assay on the 
ARCHITECT immunoassay analyzer 
detecting antibodies to envelope 
and NS4A/B antigens, yielded im- 
portant epidemiologic data (32, 33). 
We established that HPgV-2 was 

a rare, chronic bloodborne virus 
strongly associated with hepatitis C 
coinfection and likely transmitted by 
intravenous drug use. Concurrent 
studies from Kapoor et a/. and other 
subsequent reports around the 
world corroborated this conclusion 
(34-36). 

Stopping the next pandemic 
requires us to follow as many leads 
as possible. To do this effectively 
demands the building and optimi- 
zation of methods and the devel- 
opment of the next generation 
of virus hunters. Recovering full 
genomes and comprehending viral 
genetic diversity from individuals 
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with either low viral loads or limited sample volume necessi- 
tated NGS sensitivity improvements. Target capture methods 
(xGen) were first applied to HPgV-2 and then developed for 
HIV, hepatitis B and D, and SARS-CoV-2 viruses (37-40). 
HIV probes proved instrumental in amplifying reads from 

a precious remnant sample to determine a third full-length 
sequence that finally established HIV-1 subtype L (47). This 
sensitive method was exploited once again to identify a high 
percentage of HIV elite controllers—defined as HIV+ indi- 
viduals maintaining no detectable viral load for 12 months 
with no antiviral treatment—in the Democratic Republic 

of Congo (42). Hybridization of probes against all human 
viruses represents not only a 1000x boost in sensitivity but 
also allows high multiplexing and rapid data analysis, as the 
myriad of human background reads are washed away and 
not sequenced (43). This shortcut, in parallel with mNGS for 
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detection of nonviral and divergent viral reads, is now a pillar 
of APDC patient sample characterization. 

In addition to NGS optimization, streamlining laboratory 
methods and developing new assays applied either upstream 
or downstream of sequencing has been another key prior- 
ity. This has entailed staffing a deep bench knowledgeable 
and experienced in molecular [e.g., quantitative PCR (qPCR) 
and isothermal amplification] and serologic detection (e.g., 
high-volume, rapid tests). Our biosafety level (BSL)-3 facility, 
used to study high-risk pathogens, has proven indispensable 
in our response to COVID-19 (44, 45). Whether working with 
a known pathogen or culturing newly discovered viruses, this 
BSL-3 lab gives us the ability to respond quickly to shifting 
landscapes (e.g., new SARS variants of concern) and explore 
virus biology to make meaningful scientific contributions 
(46). APDC is equally focused on data-analysis infrastructure. 


Our bioinformatics team is building a new metagenomics 
and discovery pipeline (DiVir) with a web interface that 
harnesses the power and scalability of cloud computing. 
Phylogenetics expertise is another area we have invested in. 
From rewriting the taxonomic classification of picobirnavi- 
ruses (PBVs) to discerning the origins of SARS-CoV-2 strains 
circulating in Senegal, applying state-of-the-art analyses to 
our data is paramount (47, 48). 

Assembling these various elements has culminated in two 
recent reports highlighting these strengths. A novel PBV was 
discovered in a patient hospitalized in Barranquilla, Colom- 
bia with acute respiratory illness (ARI) (49). Development of 
an RT-qPCR on the m2000 system led to the identification 
and subsequent sequence characterization of 25 additional 
strains from the United States and Colombia. Phylogenetic 
analysis of the RNA-dependent RNA polymerase enzyme 
indicated the emergence of two lineages that clustered 
together as a distinct genotype: One was linked to the index 
case and the other to Asian strains previously associated 
with ARI. A “novel” orthobunyavirus was also found in an 
HBV+ patient in the Democratic Republic of Congo, which 
after 50 years in a freezer, was recently identified by the 
U.S. Centers for Disease Control and Prevention as Bangui 
virus from the Central African Republic (50). While related 
bunyaviruses have been reported to cause disease, RT-qPCR 
screening with a newly developed test failed to identify 
additional cases of infection in Africa. Phylogenetic analysis 
of this and other members of the Anopheles A serogroup 
reveal these are ancient viruses that infrequently spill over to 
humans. These and other viruses we have yet to describe are 
unlikely to cause the next pandemic, but we have the tools in 
place to make these discoveries and develop the diagnostics 
to detect them. 


Virus discovery within the APDC network 

A fundamental promise of APDC is the sharing of resources, 
expertise, and samples so that together we can remain vigi- 
lant while also making new discoveries. To ensure standard- 
ization of data input, we are first defining diagnostic criteria 
for case finding of “illnesses of unknown origin.’ Key to this 
enrollment process are universal prescreening methods (e.g., 
qPCR) to flag common pathogens involved in acute respira- 
tory or febrile illnesses, to name a few conditions. Abbott is 
also providing protocols and resources for conducting mNGS 
and target enrichment, thereby building sequencing capacity 
in-country. In some cases, specimens are shipped to Abbott 
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for processing, while in others, both parties sequence the 
same sample to ideally arrive at the same result. Transfer 

of panels containing positive controls ensures laboratory 
methods are working as expected, yet here again, this is only 
half the equation. Abbott has designed the DiVir pipeline 
with remote, cloud-based data upload and visualization for 
our APDC partners. This will allow the end-user to analyze 
their own data and to be the ones to drive their research 
forward. Similarly, by providing Abbott instrumentation, these 
sites will also be versed in their operation and ready to test 
or develop their own “research use only” (RUO) assays on 
these platforms. 

Our identification of an outbreak of Oropouche fever in 
Colombia is a prime example of just how these collabora- 
tions can function and provide real public health value. Fever 
clinics at four sites in Colombia screening for dengue, Zika, 
chikungunya, Chagas, and other viruses designated undiag- 
nosed cases for further NGS analysis. An acute Oropouche 
bunyavirus infection was detected, and the full-length 
sequence indicated that the virus had been circulating un- 
detected in Colombia for years after an initial case report in 
2017 (57). Both an RT-qPCR assay on the m2000 system and 
immunoglobulin M/immunoglobulin G (IgM/IgG) serologic 
assays on the ARCHITECT platform were rapidly developed 
to screen nearly 1,000 fever samples collected between 2019 
and 2021. We observed an 11% incidence of new infections 
and 4.5% IgM/15.2% IgG antibody positivity. This work high- 
lighted the importance of logistics and the need for infra- 
structure to be in place to enable a rapid response. Whether 
it's sample collection, overseas shipments, instrumentation 
placements, or having sequencing and qPCR/serology 
capabilities at Abbott and abroad, working out these details 
and actively building technical capacity in advance of an 
outbreak makes all the difference for timely reporting and 
preparedness. 


Preparedness for a rapid, informed response 
Diagnostics are unquestionably at the center of any pan- 
demic response strategy: Every aspect of clinical manage- 
ment and a public health response begins with a positive or 
negative test. Whether tallying up new cases to determine 
infection rates, monitoring how long a patient is shedding 
infectious virus to determine quarantine lengths, or looking at 
induction and maintenance of antibody responses following 
immunization and natural exposure, a diagnostic is the start- 
ing point. Indeed, tracking new variants through sequencing 
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and sophisticated phylogenetics requires first identifying tion within or hand-in-hand with hospitals that have access 
a positive sample. For SARS-CoV-2, Abbott correlated the to patients. Regardless of whether APDC members make the 
PCR cycle threshold (Ct, the number of cycles required to initial discovery or read about the next potential pandemic in 


detect the target above the 
background) with virus growth 
in culture to indicate the viral 


ature Se 44Inthe first year as an APDC partner, 
oad likely required for transmis- 
sion and hence the sensitivity we've worked with Abbott to 


demanded of a rapid test like 
Ce een phylogenetically characterize over 1,000 
and our partners dissected 
discordant molecular and rapid 
(Panbio) test results to establish 


SARS-CoV-2 sequences to understand 


ie window aw hiens Person how the Mu variant overtook Gamma, 
is infectious (52), These types 
Speedie On nant ieee while also screening over 1,000 patients 
lead to an appropriate interpre- 
tation of multiple data points with fever of unknown origin. In addition 
and an informed public health 
response. While proof-of- to our work, other sites are also 
principle tests and academia- 
led initiatives are admirable and actively publishing research, identifying 
needed, the ability of industry to 
rapidly deliver reliable products outbreaks, and developing diagnostic 
in mass quantities that adhere 
to government quality standards tools. By copublishing and sharing 
and specifications (e.g., U.S. . 
Food and Drug Administration patient samples, staff, and technology, 
approval and European CE 
marking, among others) is frank- we are able to work together more 
ly what the world expects and : care . 
exactly what Abbott delivered collaboratively and efficiently, resulting 
during the COVID pandemic. ? e 
Setting up the Virus Discovery in a greater mutual benefit to our 
program across APDC sites is - 
intended to serve as a “sentinel members and to public health.9 | 
system.’ This research is often 
referred to as “looking for the —Jorge Osorio, codirector of the Colombia-Wisconsin 
needle in the haystack,” but One Health Consortium and president and 
in another way, it’s more like CEO of VaxThera 


playing the lottery: You have to 
buy a ticket to win. Our chance 
of success increases with each 


additional site exploring more patient samples, applying the the news, we are poised to act together immediately. Meth- 
most sensitive tools available, and following a well-thought- ods, training, instrumentation, and funding will already be in 
out strategy. If a new threat emerges, our network sites func- — place to sequence and comprehend genetic diversity; design 
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and produce diagnostic tests; and understand the epidemi- 


ology. With testing distributed across sites, RUO assays can 


quickly progress to scaled-up versions with diagnostic au- 
thorization. APDC, at its core, is a network of action. Abbott 


is not here to be a funder, but rather as an engaged, equal 
partner: We are making an investment and doing the hard 


work, the results of which are already yielding benefits. 
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The key role of field epidemiology in preventing the next 
pandemic: The partnership between Abbott and Training 
Programs in Epidemiology and Interventions Network 


(TEPHINET) 


Francisco Averhoff', Lara Pereira”, Susan Gawel' 


n March 11, 2020, prompted by the detection of over 
118,000 cases, 4,000 deaths, and alarming levels of 
spread and severity in more than 100 countries and 
territories, the World Health Organization (WHO) 
declared the novel coronavirus 2019 (COVID-19) 
outbreak a global pandemic (7). After that pivotal 
announcement, the absence of a coordinated global pre- 
paredness plan was evident, highlighting the need to focus 
on pandemic preparedness for the next pandemic. The 
COVID-19 pandemic also underscored the critical need for 
disease surveillance as well as the importance of trained 
frontline public health professionals, such as field epidemi- 
ologists and clinicians, who can recognize unusual medical 
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events or clustering of medical conditions or syndromes. 
Further, the global spread and impact of the SARS-CoV-2 
virus demonstrated that coordination and collaboration 
among public health networks regionally and worldwide are 
essential to ensure a rapid and robust response to infectious 
disease threats. Abbott, and the Abbott Pandemic Defense 
Coalition (APDC), recognized the need to partner with and 
support epidemiologic programs as key in early detection of 
and response to pandemic threats. 

In the 1980s, several countries with funding and techni- 
cal support from the U.S. Centers for Disease Control and 
Prevention (CDC) and other partners developed field-based 
training programs in applied epidemiology in response to 
the need to strengthen their local and national public health 
capacity and infrastructure. The benefits resulting from these 
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FIGURE 1. Countries with Training in Epidemiology and Public Health Interventions network (TEPHINET)-affiliated Field Epidemiology Training 


Programs (FETPs) (n=75), September 2021. 


programs working together prompted the organization of 

a global network and, with support from WHO, CDC, and 
Fondation Mérieux, this network was formalized in 1997 as 
the Training Programs in Epidemiology and Public Health 
Interventions Network (TEPHINET). TEPHINET is a program 
of The Task Force for Global Health, originally founded as the 
Task Force for Child Survival by Dr. Bill Foege in 1984 (2). 
TEPHINET has grown into a global network of 75 Field 
Epidemiology Training Programs (FETPs) in more than 100 
countries (see Figure 1). TEPHINET partners with local and 
national government ministries, academia, nonprofits, non- 
governmental organizations, and now, through this partner- 
ship with Abbott and the APDC, with the private sector. 

The TEPHINET/FETP model—“training through ser- 
vice"—embraces a culture of shared expertise and resources, 
which facilitates the program's alignment with regional and 
in-country public health needs to ensure sustainable impact. 
Training of FETP Fellows is achieved through a three-tiered 
approach that addresses frontline (local), intermediate 
(state), and advanced (national and academic) training 
needs, mirroring the public health infrastructure in many 
countries. The program confers either a certificate of comple- 
tion or a Master of Public Health (MPH) degree, depending 
on the training level and in-country university collaborations. 
TEPHINET supports core competency domains that en- 
compass both infectious and noncommunicable diseases, 


including surveillance, outbreak investigations, epidemiologic 
methods, scientific communication, and mentorship. Key to 
TEPHINET's mission is the development and dissemination 
of learning resources for both trainers and trainees, which in- 
clude curriculum, knowledge hubs for high-priority topics, an 
abstract database, tools for virtual and in-person instruction, 
and hosting of emergency response trainings. Additionally, 
TEPHINET accreditation of FETPs ensures standardization 
of quality training and increased recognition of the FETPs’ 
value in addressing country-specific health priorities. 
TEPHINET has trained more than 19,000 public health 
professionals across the world. FETP trainees and alumni 
serve as “boots on the ground” in the fight for better health 
and have led, developed, and/or implemented disease sur- 
veillance systems, outbreak investigations, and mitigation 

of acute health events, including playing vital roles in the 
COVID-19 response in many countries (3, 4). 

While the timing, location, and severity of the next pan- 
demic cannot be predicted, it will be frontline field epi- 
demiologists with expertise in surveillance and outbreak 
investigations who will serve as the eyes and ears to identify 
and define the clinical, laboratory, and epidemiologic charac- 
teristics of the next emerging pathogen of pandemic poten- 
tial, and, critically, alert and inform the world of the potential 
threat. Epidemiologists working seamlessly with microbiol- 
ogists and virologists will be the backbone of early detec- 
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tion and control of future pandemic threats. This need gave 
impetus in 2021 to APDC and TEPHINET entering a first-of- 
its-kind, industry-public health partnership to support and 
strengthen epidemiologic capacity in pandemic prepared- 
ness in low- and middle-income countries. The APDC-FETP 
Fellowship, launched in 2021, provides funding, mentorship, 
and networking opportunities to TEPHINET-member FETP 
trainees and graduates to conduct innovative projects in 
pathogen discovery, surveillance, and other priority public 
health focus areas in collaboration with APDC partners in 
their home country (5). During the first year of the partner- 
ship in 2021, of 72 competitive applications, eight projects 
were launched in seven countries in South America, Africa, 
South Asia, and the Caucasus. (see Table 1). These proj- 
ects are primarily focused on developing the infrastructure 
needed to establish systems for early detection of pandemic 
threats and/or studying infectious diseases of public health 
importance in the host country. Fellows are awarded a small 
grant to fund project implementation. Each fellow is required 
to identify a local mentor and institution(s) that will host and/ 
or support implementation of their proposed project while 
also ensuring that their government ministries are aware 

of and amenable to the institution's participation or collab- 


oration in the project. In addition, an international subject 
matter expert in either epidemiology or infectious diseases is 
recruited to complement and strengthen the fellow’s mentor- 
ship experience. As of this writing, the projects are ongoing, 
with a completion date in late 2022. 

Since FETP programs are often associated with the coun- 
try's Ministries of Health and/or National Public Health Insti- 
tutes, an additional benefit of the APDC-FETP Fellowship is 
the bringing together of public health institutions with APDC 
partners in the country or region; this would provide govern- 
ment entities potential access to partners, the APDC, and 
Abbott, which have substantial capacity in pathogen discov- 
ery. These linkages may foster public-private partnerships 
that will be critical for timely response to future pandemic 
threats by strengthening collaborative laboratory and epide- 
miologic relationships nationally, regionally, and globally. 
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Colombia Surveillance of respiratory agents in acute respiratory infections in Antioquia, Colombia 
Georgia (a) Association of hepatocellular carcinoma (HCC) and hepatitis C virus (HCV) in Georgia; (b) 
9 establishment of pathogen discovery surveillance in Georgia 
Brazil Respiratory and arbovirus surveillance in slum, Brasilia 
Tanzania Re-establish noninfluenza respiratory surveillance, Tanzania 
Bangladesh Establishing laboratory capacity for West Nile virus (WNV) in acute febrile illness (AFI) and acute 
g encephalitis syndrome surveillance systems, Bangladesh 
Nigeria Piloting the establishment of a dengue fever surveillance system in Bayelsa State, Nigeria 
Nigeria Meningococcal meningitis: Genomic sequencing of meningococcus serogroup X in the meningitis 
9 outbreak, Northern Nigeria 
Uganda Establishing sentinel surveillance for blackwater fever (BWF) in Uganda 


TABLE 1. Training in Epidemiology and Public Health Interventions Network (TEPHINET)-Abbott/Abbott Pandemic Defense Coalition (APDC) 


Fellowship Awards, 2021-2022. 
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The case for networking the networks 


Gavin Cloherty'?, Nicaise Ndembi*“, lan Lipkin®, Sandra Chavez® 


etection and epidemiological characterization of 
infectious disease outbreaks is key for early identifi- 
cation and response to potential pandemic threats. 
The International Health Regulations (IHR) of 2005 
committed all member states of the United Nations to 
developing the capacity to detect infectious diseases 
and to report those diseases in a transparent manner. Fifteen 
years later, the goals of the IHR had not been achieved. 

The rapid, global spread of severe acute respiratory 
syndrome coronavirus 2 (SARS-CoV-2) in 2020 highlighted 
the gaps in the early-warning and global-response systems 
that were in place prior to the pandemic. One gap in the 
world’s ability to rapidly respond was a lack of coordination 
on critical aspects of infectious disease surveillance and 
outbreak response. The 700 Days Mission to Respond to 
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Future Pandemic Threats (1)—a goal established by the Group 
of Seven (G7) to harness public-private collaboration and 
scientific innovation to deploy diagnostics, therapeutics, and 
vaccines within the first 100 days of a pandemic's identifica- 
tion—recognized that we have seen tremendous innovations 
from partnerships between academia, industry, international 
organizations, philanthropy, and governments responding 
to the SARS-CoV-2 pandemic. However, industry research 
and collaboration were absent from active participation in 
prepandemic public health initiatives. To avoid repeating 
history, we must learn from current and previous pandemics. 
The world is currently making significant investments in 
pandemic preparedness. In recent months, numerous pan- 
demic preparedness initiatives have been launched around 
the world; some overlap in scope but vary by geography or 
specific area of focus, while others are tackling different yet 
related aspects of pandemic preparedness. These initiatives, 
if operating independently, share the same fundamental lack 
of integration that was the Achilles heel of previous efforts, 
including duplication of effort, wasted resources, a lack of 
visibility into emerging threats, and fragmented response 
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affecting the development of tools needed to respond quick- 
ly. If we can find a way to connect the different programs, 
including initiatives led by industry, the world will recognize 
the full potential of this historic moment. 


Programs and frameworks for pandemic 
preparedness 

In the United States, the Pandemic Innovation Task Force 
within the White House Office of Science and Technology 
Policy (OSTP) was established to develop new countermea- 
sures against possible future COVID-19 variants and other 
pandemic threats. The task force is charged with addressing 
all aspects of pandemic preparedness, including vaccina- 
tion, therapeutics, diagnostics, and other interventions to 

be used in response to emergent variants or other public 
health threats. The need for such programs dedicated to the 
preparation for and response to future pandemics is critical 
if we are to avoid the devastating impact that responses to 
emerging threats have had on treatment of already existing 
epidemics and pandemics. This includes diseases such as 
HIV and syphilis that have increased sharply due in no small 
part to the tremendous strain the SARS-CoV-2 pandemic has 
placed on a stretched and neglected public health system. 

Several public health programs have already been estab- 
lished, such as the National Institute of Allergy and Infectious 
Disease Centers for Research in Emerging Infectious Diseas- 
es (CREID), part of the U.S. National Institutes of Health (2). 
CREID plans to develop a framework and the infrastructure 
necessary to respond quickly and effectively to future out- 
breaks by building a multidisciplinary network of teams of in- 
vestigators that will conduct pathogen/host surveillance and 
study pathogen transmission, pathogenesis, and immuno- 
logic responses in the host. This program is intended to help 
develop capacity for emerging infectious disease research in 
high-risk and underserved regions around the world. 

The U.S. Centers for Disease Control and Prevention (CDC) 
(3) has been at the forefront of pandemic preparedness and 
response globally for decades. The CDC's focus was original- 
ly on pandemic influenza. Beginning early in the millennium 
following the 1997 emergence of avian influenza (H1N1) in 
Hong Kong, CDC, while maintaining vigilance for pandem- 
ic influenza, has been a leader in responding to the HIV 
pandemic through the President's Emergency Plan for AIDS 
Relief (PEPFAR) (4), and to the Ebola outbreaks in Africa and 
the Zika outbreaks in South America, among others. CDC 
has pathogen-specific expertise in epidemiology, medicine, 
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and diagnostics. It also provides leadership in systems for 
detection and response to emerging threats through the 
establishment of the Center for Preparedness and Response 
and the Division of Global Health Protection; these programs 
address not only infectious disease threats but also natural 
and manmade disasters. Virtually all of CDC's infectious 
disease and preparedness programs were mobilized in re- 
sponse to the emergence of SAR-CoV-2 in 2019. 

In a similar vein, an academic-led initiative out of Co- 
lumbia University, the Global Alliance for Preventing Pan- 
demics (GAPP) (5) builds on an infrastructure developed 
over decades of research and service. GAPP integrates the 
resources and insights of well-established subject matter 
experts as well as academic and public health institutions 
to address the urgent challenge of emerging infectious 
diseases. This alliance has partners in regions where pan- 
demic threats emerge and re-emerge due to anthropogenic 
factors such as climate change, deforestation, displacement 
of wildlife, bushmeat hunting and consumption, wet mar- 
kets, urbanization and migration, and incursion into wildlife 
habitats. It aims to develop a model that realizes and extends 
the goals of the IHR by providing inexpensive, rapid tools for 
diagnosis, discovery, and surveillance of infectious diseases. 
GAPP identifies and prioritizes infectious agents based on 
pandemic risk and shares data in a manner that provides the 
foundation for producing drugs and vaccines to prevent and 
treat infection. 

Outside the United States, as part of its vision to integrate 
pathogen genomics into public health surveillance and 
disease-outbreak response, the Africa Centres for Disease 
Control and Prevention (Africa CDC) (6), through the Africa 
Pathogen Genomics Initiative (Africa PGI), is collaborating 
with partners to strengthen laboratory networks and surveil- 
lance systems on the continent. The key components of Af- 
rica PGI include (7) strengthening the Africa CDC Institute of 
Pathogen Genomics; (2) building a continent-wide network 
of pathogen genomics and bioinformatics competency; (3) 
establishing a next-generation sequencing (NGS) Academy; 
(4) creating a data architecture that will include interoperable 
software and a cloud-based data repository; and (5) develop- 
ing high-impact use cases for pathogen genomics. Africa PGI 
has generated over 90,000 sequences from 53 African Union 
member states (except Eritrea and the Saharawi Republic). 
Thirty-six member states have referred 29,000 specimens 
to Africa PGI since January 2021. A notable success of this 
program was the identification and characterization of the 


SARS-CoV-2 Omicron variant by South African scientists at 
the Center for Epidemic Response and Innovation, a member 
of Africa PGI and the Abbott Pandemic Defense Coalition 
(APDC). 

An example of public-private partnership is the French- 
based Global Influenza Hospital Surveillance Network 
(GIHSN) (7). GIHSN is a platform that collects standardized 
data from hospitalized patients with severe acute respira- 
tory illnesses across 22 countries, including low-, middle-, 
and high-income areas from both hemispheres. Clinical and 
epidemiological data are integrated with virological data, 
including whole-genomic sequencing information, which are 
shared with public health authorities at the local, regional, 
and global level. The network relies on experienced re- 
searchers and virologists from various countries and has an 
independent scientific committee as an advisory board. Part 
of the funding comes from industry, including Sanofi Vac- 
cines, Segirus, Illumina, and Abbott. GIHSN can inform policy 
decisions and serve as a network for improved respiratory 
virus surveillance capacity building and response. In addition 
to its important role in influenza surveillance, this network is 
positioned to have a major impact on noninfluenza respirato- 
ry pathogen surveillance. By leveraging this existing network 
and infrastructure, noninfluenza severe acute respiratory 
illness could be investigated to identify novel pathogens that 
may have pandemic potential. 


Building and sharing disease data intelligence 
Apart from the initiatives and programs outlined above, 
others seek to leverage the vast amount of data being gen- 
erated around the world every day to help predict and model 
potential outbreaks with pandemic potential. This includes 
CDC, which recently announced the opening of a new center 
designed to advance the use of forecasting and outbreak 
analytics in public health decision making. The Center for 
Forecasting and Outbreak Analytics was launched in April 
2022 to bring together next-generation public health data, 
expert disease modelers, public health emergency respond- 
ers, and high-quality communications to meet the needs of 
decision makers. The new center will accelerate access to 
and use of data for public health decision makers who need 
information to mitigate the effects of disease threats, such as 
social and economic disruption. 

Globally, the World Health Organization (WHO) has 
launched the Hub for Pandemic and Epidemic Intelligence 
(8). This hub is part of WHO's Health Emergencies Pro- 
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gramme and will be a new global collaboration of countries 
and partners, driving innovation to increase the availability of 
key data, develop state-of-the-art analytic tools and predic- 
tive models for risk analysis, and link communities of practice 
around the world. Critically, it will support the work of public 
health experts and policymakers in all countries with the 
tools needed to forecast, detect, and assess epidemic and 
pandemic risks so they can make rapid decisions to prevent 
and respond to impending public health emergencies. The 
Global Outbreak Alert and Response Network (GOARN) (9) 
is another WHO-sanctioned network comprising technical 
and public health institutions, laboratories, nongovernmental 
organizations, and other organizations that work to detect, 
investigate, and respond to outbreaks, and report on poten- 
tial pandemic threats. 

The United Kingdom is working with the WHO to launch a 
“global pandemic radar” to identify emerging COVID-19 vari- 
ants and track new diseases around the world. The pathogen 
surveillance network is intended to save lives and protect 
health systems by spotting diseases before they cause future 
pandemics. 

The Rockefeller Foundation launched the Pandemic 
Prevention Institute (PPI) (70), a collaborative organization 
with a global network of partners committed to building data 
sets and analytics needed to detect, mitigate, and prevent 
pandemics. This institute plans to work with others, such as 
the WHO's Hub for Pandemic and Epidemic Intelligence and 
the UK's Global Pandemic Radar. PPI is also accelerating 
work to develop analytic tools and algorithms that detect 
early-warning signals wherever they occur; their goal is to 
design technology that sees and shares the signs of potential 
outbreaks and supports sharing of those signals. 


Integrating industry into global public health 
Initiatives 
One element that is often neglected is the role industry 
could play if engaged in active collaboration—beyond the 
traditional funder/donor relationship—with public health 
and academic initiatives. One example of this potential is the 
APDC. Launched in early 2021, APDC is a natural extension 
of Abbott's work in diagnostics, virus discovery, and virus 
surveillance over the past three decades. 

In 1994, the Abbott Global Surveillance Program was 
launched and continues to be active today. This program 
is dedicated to tracking the evolution of HIV, viral hepati- 
tis, SARS-CoV-2, and other pathogens around the world. 
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In 2009, Abbott partnered with the University of California, 
San Francisco, to create the first-of-its-kind, nonprofit viral 
diagnostics center, known as the Viral Diagnostics and 
Discovery Center (VDDC), to identify unknown viruses that 
cause acute and chronic human illnesses, including out- 
breaks and rare and unexplained diseases. Building on this 
experience and expertise, APDC is another first-of-its-kind, 
industry-led coalition with global, multisector scientific and 
public health partnerships, whose primary objective is the 
early detection and mitigation of infectious disease threats 
of pandemic potential. As of this writing, the network has 
partners on five continents, including academic institutions, 
and governmental and nongovernmental organizations. One 
insight that became clear during the SARS-CoV-2 pandemic 
is that, for diagnostics, one size does not fit all; we need a 
variety of tools to effectively protect public health, from sim- 
ple, decentralized or at-home rapid tests, to highly sensitive, 
complex, and automated laboratory-based tests. No one 
knows what type of diagnostic tests will be needed to pre- 
vent the next pandemic. A novel element of APDC is the ca- 
pacity for early development and rapid deployment of quality 
diagnostics. Additionally, centralized and decentralized rapid 
tests targeting newly identified pathogens of pandemic 
potential can be quickly and seamlessly produced at scale 
to respond to an emerging threat, at least until vaccines and 
therapeutics are available as the best defense. 

Combining industry resources with traditional epidemi- 
ological disease surveillance tools, pathogen monitoring 
in animals, genomic sequencing, and artificial intelligence 
is vital for the creation of a global disease-warning system 
capable of speeding up identification of the next pandemic 
threat and alerting scientists and developers to develop an 
effective response. 


Optimizing and connecting global efforts 
How can we optimize the synergies of these different initia- 
tives that share a common vision and purpose into an effi- 
cient, effective collaboration to prevent the next pandemic? 
At the minimum, there needs to be open, regular communi- 
cation between groups involved in pandemic preparedness. 
Sharing of information in the traditional way, through scien- 
tific literature or congresses, is not fast or nimble enough to 
meet this need. Preprint servers helped fill the gap during the 
pandemic; however, as submissions are neither peer re- 
viewed nor curated, these servers provided both information 
and misinformation. We can do better. 


Although many networks already share partner sites, each 
network should have a representative sitting on the steering 
committee of the other’s network, so there is visibility not 
only of what is being done but also what is being planned. 
Information shared should include geographies covered by 
networks, including the sites involved and infrastructure and 
expertise in place as well as the specific scope and focus 
of the work being conducted. In this context, scientific data 
such as interesting cases or outbreaks—ideally including 
sequence, clinical, and metadata—could also be rapidly 
disseminated throughout a “network of networks.” The rapid 
sharing of this type of information, especially together with 
samples, would facilitate rapid production of diagnostic tests 
that could be distributed throughout the greater network of 
networks to expedite further investigation and/or response. 

Governments, philanthropic institutes, and industry are 
making record investments in public health initiatives aimed 
at pandemic preparedness. We must nonetheless under- 
stand that these resources are finite and subject to “panic 
and neglect” cycles of funding. A coordinated effort that 
leverages these assets would reduce waste and increase the 
impact of the investment. To demonstrate to funding agen- 
cies that these initiatives are sustainable and worthy of long- 
term support, we should also emphasize their public health 
benefits between pandemics. We can keep these networks 
dynamic by extending surveillance to antimicrobial resis- 
tance and food security and by investing in interventions that 
reduce the morbidity and mortality of infections that result in 
chronic and acute diseases. 
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Building genomic sequencing capacity in Africa to 
respond to the SARS-CoV-2 pandemic 
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enomics surveillance aims to transform public health 
interventions by monitoring genetic changes that 
impact pathogenicity, diagnostics, therapeutics, and 
vaccines. Monitoring the genetic changes in SARS- 
CoV-2 has played an important role in shaping the 
scientific response to the pandemic and allowed the 
identification of several variants of interest (VOI) and five 
variants of concern (VOC) to date. Although Africa ac- 
counts for only about 2.5% and 4.1% of the world’s reported 
COVID-19 cases and deaths, respectively (7, 2), two of the 
VOC were identified by scientists from South Africa. Here, 
we reflect on some of the investments and capacity develop- 
ment initiatives that have resulted in an exponential growth 
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in genomic sequencing capabilities across the continent over 
the past 2 years. 

Early in the SARS-CoV-2 pandemic, genomic surveillance 
was available in just a few African countries with only 5,245 
SARS-CoV-2 genome sequences being made publicly avail- 
able in 2020 (3). In 2020 and 2021, significant investments in 
equipment and training were made to extend the geographic 
coverage of sequencing within many laboratories in Africa, 
thus increasing surveillance capacity on the continent. These 
investments resulted in an exponential increase in the num- 
ber of SARS-CoV-2 genome sequences produced (Figure 
1). Interestingly, it took 375 days to produce the first 10,000 
SARS-CoV-2 genomes, 87 days to produce the next 10,000, 
and just 24 days to produce the most recent 10,000 genomes. 
To date, almost 100,000 SARS-CoV-2 genome sequences 
from Africa have been shared, and 54 African countries are 
now contributing to SARS-CoV-2 genome sequencing. 

In 2020, the World Health Organization (WHO) and the Af- 
rica Centres for Disease Control and Prevention (Africa CDC) 
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FIGURE 1. Increase in the number of SARS-CoV-2 sequences (solid red line) and African countries (blue circles) contributing 
genomic sequences to the Global Initiative on Sharing All Influenza Data (GISAID) between January 2020 and March 2022. Note 
that countries contributing to genomic sequences include 52 African countries and two overseas territories (Reunion and Mayotte). 
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launched the Pathogen Genomic Initiative (PGI), a network 
of laboratories established to reinforce SARS-CoV-2 genome 
sequencing in Africa. Through this initiative and as part of a 
training program funded by the Rockefeller Foundation and 
Africa CDC, the Centre for Epidemic Response and Innova- 
tion (CERI) and KwaZulu-Natal Research Innovation and Se- 
quencing Platform (KRISP) laboratories in South Africa have 
provided hands-on genome sequencing and analysis training 
to 46 fellows from 21 African countries (Figure 2). Importantly, 
this training has helped build local sequencing and bioin- 
formatics capacity in several of the African countries that 
were previously sending samples to overseas laboratories for 
sequencing. In collaboration with the Africa CDC, 18 webi- 
nar sessions, which reached 123 African groups, have been 
hosted to train African researchers in the analysis of their 
own sequencing data. The first pan-African paper resulting 
from this collaboration was published in Science in 2021 (4), 
and another publication that highlights the 100,000 genomes 
generated in Africa is in preparation. An important element 
of the training is the transparent and open sharing of knowl- 
edge. To this end, the protocols established for genomic data 
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FIGURE 2. Hands-on 
genomics sequencing 
training at the Centre 
for Epidemic Response 
and Innovation in 
Stellenbosch, South 
Africa. 
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generation have been made publicly available on protocols. 
io, and all of the data analysis scripts have been shared 
in GitHub. Future training events are planned through the 
Rockefeller Foundation and the Abbott Pandemic Defense 
Coalition, which is dedicated to the early detection of, and 
rapid response to, future pandemic threats. 

The continued generation, analysis, and sharing of vi- 
rus genomes in real-time from Africa remains important to 
monitor the expected efficacy and sensitivity of different 
vaccines and nucleic acid tests across the continent. Africa's 
increased capacity to not only sequence viruses but also 
to analyze the data will surely have a multiplier effect and 
will be beneficial as the continent continues to grapple with 
ongoing and emergent pathogens in the future. 
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